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Description 

[0001] The present invention relates generally to methods and compositions for analyzing nucleic acid molecules, 
and more specifically to tags which may be utilized in a wide variety of nucleic acid reactions, wherein separation of 
5 nucleic acid molecules based on size is required. 

BACKGROUND OF THE INVENTrON 

[0002] Detection and analysis of nucleic acid molecules are among the most important techniques in biology. Such 
10 techniques are at the heart of molecular biology and play a rapidly expanding role in the rest of biology. 

[0003] Generally, one type of analysis of nucleic acid reactions involves separation of nucleic acid molecules based 
on length. For example, one widely used technique, polymerase chain reaction (PGR) (see, U.S. Patent Nos. 4,683,195, 
4,683,202, and 4,800,1 59) has become a widely utilized technique to both identify sequences present in a sample and 
to synthesize DNA molecules for further manipulation. 
15 [0004] Briefly, in PGR, DNA sequences are amplified by enzymatic reaction that synthesizes new DNA strands in 
either a geometric or linear fashion. Following amplification, the DNA sequences must be detected and identified. 
Because of non-specific amplifications, which would othenwise confuse analysis, or the need for purify, the PGR reaction 
products are generally subjected to separation prior to detection. Separation based on the size (i.e.. length) of the 
products yields the most useful information. The method giving the highest resolution of nucleic acid molecules is 
20 electrophoretic separation. In this method, each individual PGR reaction is applied to an appropriate get and subjected 
to a voltage potential. The number of samples that can be processed Is limited by the number of wells in the get. On 
most gel apparatus, from approximately 10 to 64 samples can be separated in a single gel. Thus, processing large 
numbers of samples is both labor and material intensive. 

[0005] Electrophoretic separation must be coupled with some detection system in order to obtain data. Detection 
25 systems of nucleic acids commonly, and almost exclusively, utilize an intercalating dye or radioactive label, and less 
fi^equently, a nonradioactive label. Intercalating dyes, such as ethidium bromide, are simple to use. The dye is included 
in the gel matrix during electrophoresis or. following electrophoresis, the gel is soaked in a dye-containing solution. 
The dye can be directly visualized in some cases, but more often, and for ethidium bromide in particular, is excited by 
light (e.g., UV) to fluoresce. In spite of this apparent ease of use, such dyes have some notable disadvantages. First. 
30 the dyes are insensitive and there must be a large mass amount of nucleic acid molecules in order to visualize the 
products. Second, the dyes are typically mutagenic or carcinogenic. 

[0006] A more sensitive detection technique than dyes uses a radioactive (or nonradioactive) label. Typically, either 
* a radiolabeled nucleotide or a radiolabeled primer is included In the PGR reaction. Following separation, the radiolabel 
is "visualized" by autoradiography. Although more sensitive, the detection suffers from film limitations, such as reci- 

35 procity failure and non-linearify. These limitations can be overcome by detecting the label by phosphor image analysis. 
However, radlolabels have safely requirements, increasing resource utilization and necessitating specialized equip- 
ment and personnel training. For such reasons, the use of nonradioactive labels has been increasing in popularify. In 
such systems, nucleotides contain a label,; such as a fluorophore, biotin or digoxin, which can be detected by an 
antitx)dy or other molecule (e.g., other member of a ligand pair) that is labeled with an enzyme reactive with a chro- 

40 mogenic substrate. These systems do not have the safefy concems as described al)ove, but use components that are 
often labile and may yield nonspecific reactions, resulting in high background {i.e., low signal-to-noise ratio). 
[0007] The present invention provides novel compositions and methods which may be utilized in a wide variety of 
nucleic acid reactions, and further provides other related advantages. 

[0008] US-A-51 1 8605 relates to methods for assaying a nucleic acid analytic. WO-A-95041 60 relates to a tag reagent 

45 and assay method. 

SUMMARY OF THE INVENTION 

[0009] Briefly stated, the present invention provides compositions and methods which may be utilized in a wide 
50 variety of llgand pair reactions wherein separation of molecules of interest, such as nucleic acid molecules, based on 
size is required. Representative examples of methods which may be enhanced given the disclosure provided herein 
include PGR. differential display. RNA fingerprinting. PGR-SSGP. ollgo litations assays, nuclease digestion methods 
(e.g.. exo- and endo- nuclease based assays), and dideoxy fingerprinting. The methods described herein may be 
utilized in a wide an-ay of fields, including, for example, in the development of clinical or research-based diagnostics, 
55 the determination of polymorphisms, and the development of genetic maps. 

[0010] Within one aspect of the present invention, there is provided a compound of the formula: 
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wherein, 

T'^sjs an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and fluoride, 
and optional atoms selected from oxygen, nitrogen^ sulfur,~phosph~orus "and iodine; ~~ 
L is an organic group which allows a unique T"«-conlalning moiety to be cleaved from the remainder of the com- 
pound, wherein the T^^^ontaining moiety comprises a functional group which supports a single Ionized charge 
state when the compound is subjected to mass spectrometry and is tertiary amine, quaternary amine or organic 
add; and 

X is a functional group selected from phosphoramidite and H-phosphonate. 



[001 1] In another aspect, the present invention provides a method fordelemilning the presence of a single nucleotide 
polymorphism in a nucleic acid target comprising: 

a) amplifying a sequence of a nucleic acid target containing a single nucleotide polymorphism; 

b) generating a single strand form of the target; 

c) combining a tagged nucleic acid probe having the structure T^s.l.x' with the amplified target nucleic acid mol- 
ecules under conditions and for a time sufficient to permit hybridization of said tagged nucleic acid probe to com- 
plementary amplified selected target nucleic add molecules, wherein said tag is correlative with a particular single 
nudeotide polymorphism and is detectable by spectrometry or potentiometry; 

d) separating unhybrldized tagged probe from hybridized tagged probe by a sizing methodology; 

e) cleaving said tag from said probe: and 

f) detecting said tag by spectrometry or potentiometry, and detenmining the presence of said single nudeotide 
polymorphism. 



[001 2] In another aspect, the present invention provides a method for determining the presence of a single nudeotide 
polymorphism In a nudeic acid target comprising: 

30 

a) amplifying a sequence of a nucleic acid target containing. a single nudeotide polymorphism; 

b) combining a tagged nudeic acid primer having the stmcture T^s.l-X' with the amplified target nudeic acid 
molecules under conditions and for a time sufficient to pemiit annealing of said tagged nudeic acid primer to 
complementary amplified selected target nucleic acid molecules, wherein the oligonudeotide primer has a 3'-most 

35 base complementary to the wildtype sequence or the single nudeotide polymorphism, wherein said tag is correl- 

ative with a particular single nudeotide polymorphism and is detectable by spectrometry or potentiometry; 

c) extending the primer wherein a complementary strand to the target is synthesized when the 3'-most baseof the 
primer is complementary to the target; 

d) separating unextended tagged primer from extended tagged primer by a sizing methodology; 
40 e) cleaving said tag from said primers or extended primers; and 

f) detecting said tag by specb-ometry or potentiometry. and determining therefrom the presence of said single 
nudeotide polymorphism. 



[0013] In another aspect, the present invention provides a method for detenmining the quantity of a specific mRNA 
45 molecule in a nucleic acid population comprising: 



a) converting an RNA population into a cDNA population; 

b) adding a single strand nucleic acid (Intemal standard) containing a plurality of single nucleotide polymorphisms, 
that is otherwise identical to said cDNA target; 

50 c) amplifying a specific sequence of said cDNA target; 

d) coamplifying the internal standard, wherein said intemal standard is the same length as the cD^4A amplicon; 

e) generating a single strand form of the target; 

f) combining a set of tagged nudeic acid probes having the stmcture T^-L-X' with the amplified target cDNA and 
amplified intemal standard under conditions and for a time suffident to permit hybridization of said tagged nucleic 

55 acid probe to complementary selected target cDNA and intemal standard sequences, wherein said tag is correlative 

with a particular cDNA sequence and a second tag is correlative with the internal standard, and is detectable by 
spectrometry or potentiometry; 

g) separating unhybridized tagged probe from hybridized tagged probe by a sizing methodology; 
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h) cleaving said tag from said probes; 

i) detecting said tags by spectrometry or potentiometry; and 

j) taking the ratio of tag con-elated to cDNA to tag correlated with the internal standard, and determining therefrom 
the quantity of said cDNA. thereby determining the quantity of the specific mRNA in a nucleic acid population. 

5 

[0014] In another aspect, the present invention provides a method for determining the quantity of a single nucleotide 
polyn^orphlsm In a~nudeicacidtarget~compris ' 

a) amplifying a sequence of a nucleic acid target containing a single nucleotide polymorphism; 

10 b) generating a single strand form of the target; 

c) combining a tagged nudeic acid probe wherein the tagged nucleic acid has the structure T™s-L-X' with the 
amplified target nucleic acid molecules under conditions and for a time sufficient to permit hybridization of said 
tagged nudeic acid probe to complementary amplified selected target nudeic acid molecules, wherein said tag is 
correlative with a particular single nudeotide polymorphism and is detectable by spectrometry or potentiometry; 

15 d) separating unhybridized tagged probe from hybridized tagged probe by a sizing methodology; 

e) deaving said tag from said probes; 

f) detecting said tags by spectrometry or potentiometry; and 

j) taking the ratio of tag correlated to the wild type polymorphism to the tag correlated with the mutant polymorphism, 
and determining therefrom the quantity of said polymorphism. 

20 

[0015] In the above four methods, the tagged nudeic acid has the staicture P^-L-X'. where X* is the nudeic acid, 
and and L are as defined above. 

[001 6] Within one aspect of the present invention, methods are provided for determining the identity of a nudeic acid 
molecule, comprising the steps of (a) generating tagged nudeic acid molecules from one or more selected target 
25 nucleic acid molecules, wherein a tag is correlative with a particular nudeic acid fragment and detectable by non- 
fluorescent spectrometry or potentiometry, (b) separating the tagged fragments by size, (c) cleaving the tags from the 
tagged fragments, and (d) detecting tags by non-fluorescent spectrometry or potentiometry, and therefrom determining 
the identity of the nudeic acid molecules. 

[0017] Within a related aspect of the invention, methods are provided for detecting a selected nucleic acid molecule, 
30 comprising the.steps of (a) combining tagged nudeic acid probes with target nudeic add molecules under conditions 
and for a time sufficient to permit hybridization of a tagged nudeic acid probe to a complementary selected target 
nudeic add sequence, wherein a tagged nucleic acid probe is detectable by non-fluorescent spectrometry or potenti- 
ometry, (b) altering the size of hybridized tagged probes, unhybridized probes or target molecules, or the probe:target 
hybrids, (c) separating the tagged probes by size, (d) deaving tags from the tagged probes, and (e) detecting the tags 
35 by non-fluorescent spectrometry or potentiometry. and therefrom detecting the selected nucleic acid molecule. 

[0018] Within further aspects methods are provided for genotyping a selected organism, comprising the steps of (a) 
generating tagged nudeic acid molecules from a selected target molecule, wherein a tag is correlative with a particular 
fragment and may be detected by non-fluorescent spectrometry or potentiometry. (b) separating the tagged molecules 
by sequential length, (c) deaving the tag from the tagged molecule, and (d) detecting the tag by non-fluorescent spec- 
ie trometry or potentiometry, and thereft-om detennining the genotype of the organism, 

[0019] Within another aspect, methods are provided for genotyping a selected organism, comprising the steps of (a) 
combining a tagged nudeic acid molecule with a seleded target molecule under conditions and for a time suffident to 
pennit hybridization of the tagged molecule to the target molecule, wherein a tag Is correlative with a particular fragment 
and may be detected by non-fluorescent spectrometry or potentiometry, (b) separating the tagged fragments by se- 
45 quential length, (c) deaving the tag from the tagged fragment, and (d) detecfing the tag by non-fluorescent spectrometry 
or potentiometry. and therefrom determining the genotype of the organism. 

[0020] Within the context of the present invention it should be understood that "biological samples" indude not only 
samples obtained from living organisms (e.g., mammals, fish, bacteria, parasites, viruses, fungi and the like) or from 
the environment (e.g.. air, water or solid samples), but biological materials which may be artificially or synthetically 

50 produced (e.g., phage libraries, organic molecule libraries, pools of genomic dones, cDNA dones. RNA dones, or the 
like). Representative examples of biological samples indude biological fluids (e.g.. blood, semen, cerebral spinal fluid, 
urine), biological cells (e.g.. stem cells, B or T cells, liver cells, fibroblasts and the like), and biological tissues. Finally, 
representative examples of organisms, that may be genotyped indude virtually any unicellular or multicellular organism, 
such as warm-blooded animals, mammals or vertebrates (e.g., humans, chimps, macaques, horses, cows, pigs, sheep, 

55 dogs, cats, rats and mice, as well as cells from any of these), bacteria, parasites, viruses, fungi and plants. 

[0021] Within various embodiments of the above-described methods, the nucleic add probes and or molecules of 
the present invention may be generated by, for example, a ligation, deavage or extension (e.g.. PGR) reaction. Within 
other related aspects the nucleic acid probes or molecules may be tagged by non-3' tagged oligonudeotide primers 
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(e.g., 5'-tagged oligonucleotide primers) or dideoxynucleotide terminators. 

[0022] Within other embodiments of the Invention. 4, 5, 10, 1 5. 20, 25, 30, 35, 40, 45, 50. 60, 70, 80, 90, 100. 200, 
250, 300, 350, 400, 450. or greater than 500 different and unique tagged molecules may be utilized within a given 
reaction simultaneously, wherein each tag Is unique for a selected nucleic acid molecule or fragment, or probe, and 

5 may be separately identified. 

[0023] .The tag(s) are detected massspeclrometry Representative examples Include time-of-flight mass spectrom- 
etry, quadrupole mass spectrometry, magnetic sector mass spectrometry and electric sector mass spectrometry. Spe- 
cific embodiments of such techniques include ion-trap mass spectrometry, electrospray ionization mass spectrometry, 
ion-spray mass spectrometry, liquid ionization mass spectrometry, atmospheric pressure ionization mass spectrometry, 

10 electron Ionization mass spectrometry, fast atom bombard ionization mass spectrometry, MALDI mass spectrometry, 
photo-ionization time-of-flight mass spectrometry, laser droplet mass spectrometry, MALDI-TOF mass spectrometry. 
APCI mass spectrometry, nano-spray mass spectrometry, nebulised spray ionization mass spectrometry, chemical 
ionization mass spectrometry, resonance ionization mass spectrometry, secondary ionization mass spectrometry and 
thermospray mass spectrometiy. 

15 [0024] Within yet other embodiments of the invention, the target molecules, hybridized tagged probes, unhybridized 
probes or target molecules, probertarget hybrids, or tagged nucleic acid probes or molecules may be separated from 
other molecules utilizing methods which discriminate between the size of molecules (either actual linear size, or three- 
dimensional size). Representative examples of such methods include gel electrophoresis, capillary electrophoresis, 
micro-channel electrophoresis. HPLC. size exclusion chromatography, filtration, polyacrylamide gel electrophoresis, 

20 liquid chromatography, reverse size exclusion chromatography, ion-exchange chromatography, reverse phase liquid 
chromatography, pulsed-field electrophoresis, field-inversion electrophoresis, dialysis, and fluorescence-activated liq- 
uid droplet sorting. Altematively. the target molecules, hybridized tagged probes, unhybridized probes or target mole- 
cules, probe:target hybrids, or tagged nucleic acid probes or molecules may be bound to a solid support (e.g., hollow . 
fibers (Amicon Corporation. Danvers, Mass.), beads (Polysciences. Wam'ngton, Pa.), magnetic beads (Robbin Scien-. 

25 tific, Mountain View. Calif.), plates, dishes and flasks (Coming Glass Works, Coming, N. Y.), meshes (Becton Dickinson, 
Mountain View. Calif.), screens and solid fibers (see Edelman et aL. U.S. Patent No. 3,843,324; see also Kuroda etyal., 
U.S. Patent No, 4.416,777), membranes (Millipore Corp., Bedford. Mass.). and dipsticks). If the first or second member, 
or exposed nucleic acids are bound to a solid support, within certain embodiments of the invention the methods dis- 
closed herein may further comprise the step of washing the solid support of unbound material. 

30 [0025] Within other embodiments, the tagged nucleic acid molecules or probes may be cleaved by a methods such 
as chemical, oxidation, reduction, acid-labile, base labile, enzymatic, electrochemical, heat and photolabile methods. 
Within further embodiments, the steps of separating, cleaving and detecting may be performed in a continuous manner, 
for example, on a single device which may be automated. 

[0026] Within certain embodiments of the invention, the size of the hybridized tagged probes, unhybridized probes 
35 or target molecules, or probe.target hybrids are altered by a method selected from the group consisting of polymerase 
extension, ligation, exonuclease digestion, endonudease digestion, restriction enzyme digestion, site-specific recom- 
btnase digestion, ligation, mismatch specific nuclease digestion, methylation-specific nuclease digestion, covalent at- 
tachment of probe to target and hybridization. 

[0027] The methods an compositions described herein may be utilized in a wide variety of applications, including for 
40 example, identifying PCR amplicons. RNA fingerprinting, differentia! display, single-strand conformation polymorphism 
detection, dideoxyfingerprinting, restriction maps and restriction fragment length polymorphisms. DNA fingerprinting, 
genotyping, mutation detection, oligonucleotide ligation assay, sequence specific amplifications, for diagnostics, fo- 
rensics, identification, developmental biology, biology, nfiolecular medicine, toxicology, animal breeding, 
[0028] These and other aspects of the present Invention will become evident upon reference to the following detailed 
<s description and attached drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0029] 

50 

Figure 1 depicts the flowchart for the synthesis of pentafluorophenyl esters of chemically deavable mass spec- 
troscopy tags, to liberate tags with carboxyl amide tenmlni. 

Figure 2 depicts the flowchart for the synfliesis of pentafluorophenyl esters of chemically deavable mass spec- 
troscopy tags, to liberate tags with cartx)xyl acid termini. 
55 Figures 3-6 and 8 depict the flowchart for the synthesis of tetrafluorophenyl esters of a set of 36 photochemically 

deavable mass spectroscopy tags. 

Figure 7 depicts the flowchart for the synthesis of a set of 36 amine-terminated photochemically deavable mass 
spectroscopy tags. 
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Figure 9 depicts the synthesis of 36 photochemically deavable mass spectroscopy tagged oligonucleotides made 
from the corresponding set of 36 tetrafluorophenyl esters of photochemically deavable mass spectroscopy tag 
acids. 

Figure 1 0 depicts the synthesis of 36 photochemically deavable mass spectroscopy tagged oligonudeotides made 
5 from the corresponding set of 36 amine-terminated photochemically deavable mass spectroscopy tags. 

- - _ _ -Figure 11 illustrates the.simuUanepuj detedion of rnultipje tags by mass spectrometry. 

Figure 12 shov\^s the mass spectrogram of the alpha-cyano matrix alone. ~ 

Figure 1 3 depicts a modularly-constructed tagged nucleic acid fragment. 

Figures 14A-14I show the separation of DNA fragments by HPLC using a variety of different buffer solutions. 
10 Figure 1 5 is a schematic representation of genetic fingerprinting and differential display systems in accordance 

with an exemplary embodiment of the present invention. 

Figure 16 is a schematic representation of genetic fingerprinting and differential display systems in accordance 
with an exemplary embodiment of the present invention. 

Figure 17 is a schematic representation of assay systems In accordance with an exemplary embodiment of the 
15 present invention. 

Figure 18 is a schematic representation of assay systems in accordance with an exemplary embodiment of the 
present invention. 

Figures 19A and 19B illustrate the preparation of a deavable tag of the present invention. 
Figures 20A and 20B illustrate the preparation of a deavable tag of the present Invention. 
20 Figure 21 illustrates the preparation of an intermediate compound useful in the preparation of a deavable tag of 

the invention. 

Figures 22A, 22B and 22C illustrate synthetic methodology for preparing a photodeavable mass spectrometry- 
detectable tag according to the present invention. 

Figure 23 shows the results from a an assay which monitored gene expression with CMST-Tagged ODNs. 
25 Figures 24-28 illustrate phosphoramidite chemistry more completely described in an Example herein. 

DETAILED DESCRIPTION OF THE INVENTION 

[0030] As noted above, the present invention provides compositions and methods for analyzing nucleic acid mole- 
30 cules, wherein separation of nudeic acid molecules based on size is required. The present methods permit the simul- 
taneous detection of molecules of interest, which indude nudeic adds and fragments, proteins, peptides, etc. 
[0031 ] The present invention provides a new dass of tags for genomics measurements that provide an assay platform 
compatible with the scale of measurements required to analyse complex genomes. This new tagging technology is 
composed of mass spectrometry tags that are detected with a standard quadrapole mass spectrometer detector (MSD) 
35 using atmosphiBric pressure chemical ionization (positive mode). The technology platform uses a MSD for detection 
of known molecular weight mass spectrometer tags much like a diode-an^y detector. The tags may be synthesized 
by combinatorial chemistry approaches using a primary scaffold upon which spedfic mass adjusters are appended. 
The tags are designed to be reversibly attached to oligonucleotides which can be employed either as primers in the 
PGR setting or used as probes in hybridization assays. At the completion of any number of assay steps, the tag/probe 
40 or tag/primer Is subject to a deavage reaction, preferably photodeavage, and when the tags are mass spectrometry- 
detectable, the tags are ionized by APCI and the mass identity of the tag is detemiined by mass spectrometry. The 
tags may be used to map the Identity of a sequence and sample Identification. 

[0032] Briefly stated, In one aspect the present invention provides compounds wherein a molecule of Interest, or 
precursor thereto, is linked via a labile bond (or labile bonds) to a tag. Thus, compounds of the invention may be viewed 
45 as having the general formula: 

T^-L-X 

50 wherein T is the tag component. L is the linker component that either is. or contains, a labile bond, and X is either the 
molecule of interest (MOI) component or a functional group component (L^) through which the MCI may be joined to 
T-L. Compounds of the invention may therefore be represented by the more spedfic general formulas: 

55 T-L-MOl 

and 
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T-L-L, 

[0033] For reasons described in detail below, sets of T-L-MOl compounds may be purposely subjected to conditions 
5 that cause the labile bond(s) to break, thus releasing a tag moiety from the remainder of the compound. The tag moiety 
- — is then characterized.byone.or.more.analyticaLtechnique^, to Uiereby^ provide direct mformation about the structure 
of the tag moiety, and (most importantly) indirect information about the identity of the corresponding MOI. ~ 
[0034] As a simple illustrative example of a representative compound of the invention wherein L is a direct bond, 
reference is made to the following structure (i): 

10 



Structure (i) q 

15 




(Nucleic AckJ Fragment) 



20 1! 1 J Lmker (L) component 



25 Tag conponent Molecule oflnierest 

component 



In structure (i), T is a nitrogen-containing polycydic aromatic moiety bonded to a carbonyl group. X is a MOI (and 
30 specifically a nucleic acid fragment terminating in an amine group), and L is the bond which forms an amide group. 
The amide bond is labile relative to the bonds in T because, as recognized in the art. an amide bond may be chemically 
cleaved (broken) by acid or base conditions which leave the bonds within the tag component unchanged. Thus, a tag 
moiety (i.e., the cleavage product that contains T) may be released as shown below: 

35 



40 



45 



50 



55 
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Stmcture (0 




(Nucfejc Acid Fragrrenl) 



10 



15 



20 



acid or base 



OH HjN- 



.(Nucleic Acid Fragment) 



Tag Moiecy 



Reiminder of the CoiTfK)und 



25 



30 



[0035] However, the linker L may be more than merely a direct bond, as shown in the following illustrative example, 
where reference is made to another representative compound of the invention having the structure (ii) shown below: 



35 



40 



Structure (ii) 






NO, 











L 



1 



^(Nuclek: Acid 
Fragnient) 



MOl 



45 



50 



It is well-known that compounds having an orf/io-nitrobenzylamine moiety (see boxed atoms within structure (ii)) are 
photolytically unstable, in that exposure of such compounds to actinic radiation of a specified wavelength will cause 
selective cleavage of the benzylamine bond (see bond denoted with heavy line in structure (ii)). Thus, structure (ii) has 
the same T and MOl groups as structure (i), however the linker group contains multiple atoms and bonds within which 
there is a particularly labile bond. Photolysis of structure (ii) thus releases a tag moiety (T-containing moiety) from the 
remainder of the compound, as shown below. 



55 
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Smicture (ii) 




10 



15 



20 



(Mucleic Acid 
Fragment) 



/(Nucleic acid 
^ Fragment) 
H 



Tag Moiety 



Remainder of the Conrpound 



25 



30 



[0036] The invention thus provides compounds which, upon exposure to appropriate cleavage conditions, undergo 
a deavage reaction so as to release a tag moiety from the remainder of the compound. Compounds of the invention 
may be described in terms of the tag moiety, the MOI (or precursor thereto, L^), and the labile bond(s) which join the 
two groups together. Alternatively, the compounds of the invention may be described in terms of the components from 
which they are formed. Thus, the compounds may be described as the reaction product of a tag reactant. a linker 
reactant and a MOI reactant, as follows. 

[0037] The tag reactant consists of a chemical handle (T^) and a variable component (Ty^)* so that the tag reactant 
is seen to have the general structure: 



35 



40 



[0038] To illustrate this nomenclature, reference may be made to structure (iit), which shows a tag reactant that may 
be used to prepare the compound of structure (ii). The tag reactant having structure (iii) contains a tag variable com- 
ponent and a tag handle, as shown below: 



Structure (iii) 



45 



50 



55 




Tag Variable 
Conponent 



Tag 
Handle 
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[0039] In structure (ili), the tag handle (-C(=0)-A) simply provides an avenue for reacting the tag reactant with the 
iinker reactant to form a T-L moiety. The group "A" in structure (iii) indicates that the carboxyl group is in a chemically 
active state, so it is ready for coupling with other handles. "A" may be. for example, a hydroxyl group or pentafluoroph- 
enoxy, among many other possibilities. The invention provides for a large number of possible tag handles which may 

5 be bonded to a tag variable component, as discussed in detail below. The tag variable component is thus a part of "T* 
-in-the formula_T-L-X. andjwill also be part of the_tag moiety that forms from the reaction that cleaves L. 
[0040] As also discussed in detail below, the tag variable component is so-named because, in preparing setis of 
compounds according to the invention, it is desired that members of a set have unique variable components, so that 
the individual members may be distinguished from one another by an analytical technique. As one example, the tag 

10 variable component of structure (iii) may be one member of the following set, where members of the set may be 
distinguished by their UV or mass spectra: 



15 



20 




[0041] Likewise, the linker reactant may be described in terms of its chemical handles (there are necessarily at least 
two, each of which may be designated as L^) which flank a linker labile component, where the linker labile component 
25 consists of the required labile moiety (L^) and optional labile moieties (L^ and L^), where the optional labile moieties 
effectively serve to separate L2 from the handles Lj,. and the required labile moiety serves to provide a labile bond 
within the linker labile component. Thus, the linker reactant may be seen to have the general formula: 

[0042] The nomenclature used to describe the linker reactant may be illustrated in view of stnjcture (iv), which again 
draws from the compound of structure (ii): 



35 

Structure (iv) 




Linker 
Handle 



50 [0043] As structure (iv) illustrates, atoms may serve in more than one functional role. Thus, in structure (iv). the 
benzyl nitrogen functions as a chemical handle in allowing the linker reactant to join to the tag reactant via an amide- 
forming reaction, and subsequently also serves as a necessary part of the structure of the labile moiety \? in that the 
benzylic carbon-nitrogen bond Is particulariy susceptible to photolytic deavage. Structure (Iv) also illustrates that a 
linker reactant may have an group (in this case, a methylene group), although not have an U group. Likewise, linker 

55 reactants may have an U group but not an group, or may have and groups, or may have neither of nor 
groups. In structure (iv), the presence of the group "P" next to the carbonyl group indicates that the carbonyl group is 
protected from reaction. Given this configuration, the activated cartjoxyl group of the tag reactant (iii) may cleanly react 
with the amine group of the linker reactant (iv) to form an amide bond and give a compound of the formula T-L-L^. 
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[0044] The MOI reactant is a suitably reactive form of a molecule of interest. Where the molecule of interest is a 
nucleic acid fragment, a suitable MOI reactant is a nucleic acid fragment bonded through its 5' hydroxyl group to a 
phosphodiester group and then to an alkylene chain that terminates in an amino group. This amino group may then 
react with the carbonyl group of structure (iv). (after, of course, deprotecting the carbonyl group, and preferably after 

5 subsequently activating the carbonyl group toward reaction with the amine group) to thereby join the MOI to the linker. 

- _ .I0045]_ When viewed jn _a chrpnological^order, the invention Is seen to lake a tag reactant (having a chemical tag 
handle and a tag variable component), a linker reactant (having two chemicaflinkeT handles, a required labile moiety ~ 
and 0-2 optional labile moieties) and a MOI reactant (having a molecule of interest component and a chemical molecule 
of interest handle) to form T-L-MOI. Thus, to form T-L-MOI. either the tag reactant and the linker reactant are first 

10 reacted together to provide T-L-L^. and then the MOI reactant Is reacted with T-L-L^ so as to provide T-L-MOI. or else 
(less preferably) the linker reactant and the MOI reactant are reacted together first to provide Lt,-L-MOK and then l^- 
L-MOl is reacted with the tag reactant to provide T-L-MOl, For purposes of convenience, compounds having the formula 
T-L-MOI will be described in terms of the tag reactant. the linker reactant and the MOI reactant which may be used to 
form such compounds. Of course, the same compounds of formula T-L-MOl could be prepared by other (typically. 

15 more laborious) methods, and still fall within the scope of the inventive T-L-MOI compounds. 

[0046] In any event, the invention provides that a T-L-MOI compound be subjected to cleavage conditions, such that 
a tag moiety is released from the remainder of the compound. The tag moiety will comprise at least the tag variable 
component, and will typically additionally comprise some or all of the atoms from the tag handle, some or all of the 
atoms from the linker handle that was used to join the tag reactant to the linker reactant, the optional labile moiety L^ 

20 If this group was present in T-L-MOI. and will perhaps contain some part of the required labile moiety L^ depending on 
the predse structure of L2 and the nature of the cleavage chemistry. For convenience, the tag moiety may be referred 
to as the T-containing moiety because T will typically constitute the major portion (in terms of mass) of the tag moiety. 
[0047] Given this introduction to one aspect of the present invention, the various components T, L and X will be 
described in detail. This description begins with the following definitions of certain temis. which will be used hereinafter 

25 in describing T. L and X. 

[0048] As used herein, the term "nucleic acid fragment" means a molecule which is complementary to a selected 
target nucleic acid molecule {i.e., complementary to all or a portion thereof), and may be derived from nature or syn- 
thetically or recombinantly produced, including non-naturally occurring molecules, and may be in double or single 
stranded form where appropriate; and includes an oligonucleotide (e.g., DNA or RNA), a primer, a probe, a nudeic 

30 acid analog (e.g., PNA), an oligonudeotide which is extended in a 5' to 3' direction by a polymerase, a nucleic acid 
which is cleaved chemically or enzymatically, a nudeic acid that is terminated with a dideoxy terminator or capped at 
the 3' or 5' end with a compound that prevents polymerization at the 5' or 3' end, and comlsinations thereof. The 
complementarity of a nucleic acid fragment to a selected target nudeic acid molecule generally means the exhibition 
of at least about 70% specific base pairing throughout the length of the fragment. Preferably the nudeic acid fragment 

35 exhibits at least about 80% specific base pairing; and most preferably at least atx)ut 90%. Assays for determining the 
percent mismatch (and thus the percent specific base pairing) are well known in the art and are based upon the percent 
mismatch as a function of the Tm when referenced to the fully base paired control. 

[0049] As used herein, the term "alkyl." atone or in combination, refers to a saturated, straight-chain or branched- 
chain hydrocarbon radical containing from 1 to 10, preferably from 1 to 6 and more preferably from 1 to 4, carbon 
40 atoms. Examples of such radicals indude. but are not limited to, methyl, ethyl, n-propyl, iso-propyl, n>butyl, iso-butyl. 
sec-butyl, tert-butyl, pentyl, iso-amyl, hexyl, decyl and the like. The term "alkylene" refers to a saturated, straight-chain 
or branched chain hydrocarbon diradical containing from 1 to 10, preferably from 1 to 6 and more preferably from 1 to 
4. cartx)n atoms. Examples of such diradicals include, but are not limited to. methylene, ethylene (•CH2-CH2-), propyl- 
ene, and the like. 

45 [0050] The term "alkenyl," alone or in combination, refers to a straight-chain or branched-chain hydrocarbon radical 
having at least one carbon-carbon double bond in a total of from 2 to 1 0, preferably from 2 to 6 and more preferably 
from 2 to 4, carbon atoms. Examples of such radicals indude, but are not limited to, ethenyl. E- and Z-propenyl. iso- 
propenyl, E- and Z-butenyl, E- and Z-isobutenyl. E- and Z-pentenyt, decenyl and the like. The term "alkenylene" refers 
to a straight-chain or branched-chain hydrocarbon diradical having at least one cart)on-cariK)n double bond in a total 

50 of from 2 to 10, preferably from 2 to 6 and more preferably from 2 to 4. carbon atoms. Examples of such diradicals 
include, but are not limited to, methylidene (=CH2), ethylidene (-CH=CH-). propylidene (-CH2-CH=CH-) and the like. 
[0051] The term "alkynyl." alone or in combination, refers to a straight-chain or branched-chain hydrocarbon radical 
having at least one cartx>n-carton triple bond in a total of from 2 to 10. preferably from 2 to 6 and more preferably from 
2 to 4, carbon atoms. Examples of such radicals indude. but are not limited to, ethynyl (acetylenyl), propynyl (propargyl). 

55 butynyl. hexynyl. decynyl and the like. The term "alkynylene", alone or in combinatton, refers to a straight-chain or 
branched-chain hydrocart)on diradical having at least one carbon-carbon triple bond in a total of from 2 to 1 0, preferably 
from 2 to 6 and more preferably from 2 to 4, carbon atoms. Examples of such radicals include, but are not limited, 
ethynylene (-C=C-), propynylene (-CH2-CSC-) and the like. 
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[0052] The term "cydoalkyi," alone or in combination, refers to a saturated, cyclic an^ngement of carbon atoms 
which number from 3 to 8 and preferably from 3 to 6, carbon atoms. Examples of such cydoalkyi radicals include, but 
are not limited to. cyclopropyl. cydobutyl, cyclopentyl, cydohexyl and the like. The term "cydoalkylene" refers to a 
diradical fonn of a cydoalkyi. 

5 [0053] The term "cydoalkenyl," alone or in combination, refers to a cydic carbocyde containing from 4 to 8, preferably 
- - 5 or 6, carbon atoms and one oripoje double bonds. Examples of such cycloalkenyl radicals indude. but are not limited 
to, cyclopentenyl, cydohexenyl, cydopentadienyl and the like. The term "cydoalkenylehe" refersloli diradical fdrm of " 
a cydoalkenyL 

[0054] The term "aryl" refers to a cartjocyclic (consisting entirely of carbon and hydrogen) aromatic group selected 
10 from the group consisting of phenyl, naphthyl, indenyl, indanyl, azulenyl. fluorenyl. and anthracenyl; or a heterocydic 
aromatic group selected from the group consisting of furyl, thienyf. pyridyl, pyrrolyl. oxazolyly, thiazolyl, imidazolyl. 
pyrazolyl, 2-pyrazolinyl. pyrazolidinyl, isoxazolyl, isothiazolyl, 1. 2, 3-oxadiazolyl, 1. 2. 3-triazolyl. 1. 3, 4-thiadiazolyl, 
pyridazinyl, pyrimidinyl, pyrazinyl, 1,3, 5-triazinyl, 1. 3, 5-trithianyl, indolizinyl, indolyl, isoindoiyi, 3H-indolyl. indolinyl, 
benzo(blfuranyl, 2, 3-dihydrobenzofuranyl, benzo[b]thiophenyl, 1H-indazolyl, benzimidazolyl, benzthiazolyl, purinyl, 
15 4H-quinolizinyl. quinolinyl, isoquinolinyl, dnnolinyl, phthalazinyl, quinazolinyl, quinoxalinyl, 1, 8-naphthyridinyl. pterid- 
inyl. carbazolyl, acridinyl. phenazinyt. phenothiazinyl. and phenoxazinyl. 

[0055] "Aryl" groups, as defined in this application may independently contain one to four substituents which are 
independently selected from the group consisting of hydrogen, halogen, hydroxyl, amino, nitro, trifluoromethyl, trifluor- 
omethoxy, alkyi, alkenyl, alkynyl. cyano, cartx)xy, carboalkoxy, 1 .2-dioxyethylene, aikoxy, alkenoxy or alkynoxy, 

20 alkylamino. alkenylamino. alkynylamino. aliphatic or aromatic acyl. alkoxy-carbonylamino, alkylsulfonylamino, mor- 
pholinocarbonylamino. thiomorpholinocarbonylamino, N-alkyI guanldino, aralkylaminosulfonyl; aralkoxyalkyl; N-ar- 
alkoxyurea; N-hydroxylurea; N-alkenyturea; N,N-(alkyl, hydroxyl )urea; heterocydyl; thioaryloxy-substituted aryl; N, 
N-(aryl. alkyl)hydrazino; Ar'-substituted sulfonylheterocydyl; aralkyl-substituted heterocydyl; cydoalkyi and cydoake- 
nyl-substituted heterocydyl; cydoalkyl-fused aryl; aryioxy-substituted alkyI; heterocyclylamino; aliphatic or aromatic 

25 acylaminocarbonyl; aliphatic or aromatic acyl-substituted alkenyl; Ar'-substituted aminocarbonyloxy; Ar*, Ar'-disubsti- 
tuted aryl; aliphatic or aromatic acyl-substituted acyl: cydoalkylcarbonylalkyl; cydoalkyl-substituted amino; aryloxycar- 
bonylalkyl; phosphorodiamidyl acid or ester; 

[0056] "Ar*" is a carbocydic or heterocyclic aryl group as defined above having one to three substituents selected 
from the group consisting of hydrogen, halogen, hydroxyl, amino, nitro, trifluoromethyl, trifluoromethoxy, alkyl, alkenyl. 
30 alkynyl, 1 ,2-dioxymethylene, 1 ,2-dioxyethylene, aikoxy. alkenoxy, alkynoxy. alkylamino. alkenylamino or alkynylamino. 
alkylcarbonyloxy, aliphatic or aromatic acyl. alkylcarbonylamino. alkoxycarbqnylamino. alkylsulfonylamino. N-alkyI or 
N.N-dialkyI urea. 

[0057] The term "aikoxy." alone or in combination, refers to an alkyl ether radical, wherein the tenn "alkyl" is as 
defined above. Examples of suitable alkyl ether radicals include, but are not limited to. methoxy, ethoxy, n-propoxy, 
35 iso-propoxy. n-butoxy. iso-butoxy. sec-butoxy. tert-butoxy and the like. 

[0058] The terni "alkenoxy," alone or in combination, refers to a radical of formula alkenyl-O-, wherein the term 
"alkenyl" is as defined above provided that the radical is not an enol ether. Examples of suitable alkenoxy radicals 
indude, but are not limited to, allyloxy. E- and Z-3-methyl-2-propenoxy and the like. 

[0059] The term "alkynyloxy," alone or in combination, refers to a radical of formula alkyny1-0-, wherein the term 
40 "alkynyl" is as defined above provided that the radical is not an ynol ether. Examples of suitable alkynoxy radicals 
Include, but are not limited to, propargyloxy, 2-butynyloxy and the like. 

[0060] The tenn "thioalkoxy" refers to a thioether radical of formula alkyl-S-, wherein alkyl is as defined above. 
[0061] The'temi "alkylamino." alone or in combination, refers to a mono- or di-alkyl-substituted amino radical (Le.. a 
radical of fomiula alkyl-NH- or (alkyOj-N-), wherein the term "alkyl" is as defined above. Examples of suitable alkylamino 
45 radicals include, but are not limited to, methylamino, ethylamino. propylamine, isopropylamino. t-butyiamino, N.N-di- 
ethylamino and the like. 

[0062] The term "alkenylamino," alone or in combination, refers to a radical of fomiula alkenyl-NH- or (alkenyl)2N-, 
wherein the temi "alkenyl" is as defined above, provided that the radical is not an enamine. An example of such alke- 
nylamino radicals is the allylamino radical. 
50 [0063] The term "alkynylamino," alone or in combination, refers to a radical of formula alkynyl-NH- or {alkynyl)2N-, 
wherein the tenn "alkynyl" is as defined above, provided that the radical Is not an ynamine. An example of such alky- 
nylamino radicals is the propargyl amino radical. 

[0064] The tenn "amide" refers to either -N(R^)-C(=6)- or -C(=0)-N(Ri)-where is defined herein to indude hy- 
drogen as well as other groups. The term "substituted amide" refers to the situation where R^ is not hydrogen, while 
55 the term "unsubstituted amide" refers to the situation where R^ is hydrogen, 

[0065] The term "aryloxy," alone or in combination, refers to a radical of formula aryl-0-, wherein aryl is as defined 
above. Examples of aryloxy radicals include, but are not limited to, phenoxy. naphthoxy. pyridyloxy and the like. 
[0066] The tenm "arylamino," alone or in combination, refers to a radical of formula aryl-NH-. wherein aryl is as defined 
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above. Examples of arylamino radicals include, but are not limited to. phenylamino (ani(ido), naphthylamino, 2-, 3- and 
4-pyridylamino and the like. 

[0067] The term "aryl-fused cydoalkyi," alone or in combination, refers to a cycloalkyi radical which shares two ad- 
jacent atoms with an aryl radical, wherein the terms "cycloalkyi" and "aryl" are as defined above. An example of an 

5 aryl-fused cydoaikyl radical is the benzofused cyclobutyl radical. 

[0068] _The teip "ajkylcarbonylaminq," alone or in combination, refere to _a radical of formula alkyl-CONH, wherein 
the term "alkyl" is as defined above. ~ 
[0069] The tenm "alkoxycart)onylamino," alone or in combination, refers to a radical of formula alkyl-OCONH-, wherein 
-the term "alkyl" Is as defined above. 

10 [0070] The term "alkylsulfonylamino," alone or in combination, refers to a radical of formula alkyt-S02NH-. wherein 
the term "alkyl" is as defined above. 

[0071] The term "arylsulfonylamino," alone or in combination, refers to a radical of formula aryl-S02NH-, wherein 
the tenn "aryl" is as defined above. 

[0072] The temi "N-alkylurea," alone or in combination, refers to a radical of formula alkyl-NH-CO-NH-, wherein the 
15 term "alkyl" is as defined above. 

[0073] The term "N-arylurea," alone or in combination, refers to a radical of formula aryl-NH-CO-NH-, wherein the 
temi "aryl" is as defined above. 

[0074] The term "halogen" means fluorine, chlorine, bromine and iodine. 

[0075] The term "hydrocarbon radical" refers to an arrangement of carbon and hydrogen atoms which need only a 
20 single hydrogen atom to be an independent stable molecule. Thus, a hydrocarbon radical has one open valence site 
on a cartK>n atom, through which the hydrocarbon radical may be bonded to other atom(s). Alkyl. aikenyl, cycloalkyi, 

etc. are examples of hydrocarbon radicals. 

[0076] The term "hydrocarbon diradical" refers to an arrangement of carbon and hydrogen atoms which need two 
hydrogen atoms in order to be an independent stable molecule. Thus, a hydrocarbon radical has two open valence 
25 sites on one or two carbon atoms, through which the hydrocarbon radical may be bonded to other atom(s). Alkylene, 
alkenylene. alkynylene, cycloalkylene. etc. are examples of hydrocarbon diradicals. 

[0077] The term "hydrocarbyl" refers to any stable arrangement consisting entirely of carbon and hydrogen having 
a single valence site to which it is bonded to another moiety, and thus includes radicals known as alkyl, aikenyl, alkynyl, 
cycloalkyi, cycloalkenyl, aryl (without heteroatom incorporation into the aryl ring), arylalkyl. alkylaryl and the like. Hy- 

30 drocarbon radical is another name for hydrocarbyl. 

[0078] The term "hydrocarbylene" refers to any stable an^ngement consisting entirely of carbon and hydrogen having 
two valence sites to which it is bonded to other moieties, and thus includes alkylene, alkenylene, alkynylene, cy- 
cloalkylene, cycloalkenylene, arylene (without heteroatom incorporation into the arylene ring), arylalkylene, alky- 
larylene and the like. Hydrocarbon diradical is another name for hydrocarbylene. 

35 [0079] The term "hydrocarbyi-O-hydrocarbylene" refers to a hydrocarbyl group bonded to an oxygen atom, where 
the oxygen atom is likewise bonded to a hydrocarbylene group at one of the two valence sites at which the hydrocar- 
bylene group is bonded to other moieties. The temns "hydrocarbyl-S-hydrocarbylene", "hydrocarbyl-NH-hydrocarb- 
yiene" and "hydrocarbyl-amide-hydrocarbylene" have equivalent meanings, where oxygen has been replaced with 
sulfur, -NH- or an amide group, respectively. 

40 [0080] The term N-(hydrocarbyl)hydrocarbylene refers to a hydrocarbylene group wherein one of the two valence 
sites is bonded to a nitrogen atom, and that nitrogen atom Is simultaneously bonded to a hydrogen and a hydrocarbyl 
group. The temi N.N-di(hydrocarbyl)hydrocarbylene refers to a hydrocarbylene group wherein one of the two valence 
sites Is bonded to a nitrogen atom, and that nitrogen atom is simultaneously bonded to two hydrocarbyl groups. 
[0081] The term "hydrocarbylacyl-hydrocarbylene" refers to a hydrocarbyl group bonded through an acyl (-C(=0)-) 

45 group to one of the two valence sites of a hydrocarbylene group. 

[0082] The terms "heterocydyl hydrocarbyl" and "heterocylyl" refer to stable, cyclic arrangement of atoms which in- 
clude carbon atoms and up to four atoms (refen-ed to as heteroatoms) selected from oxygen, nitrogen, phosphorus 
and sulfur. The cyclic arrangement may be in the form of a monocyclic ring of 3-7 atoms, or a bicyclic ring of 8-11 
atoms. The rings may be saturated or unsaturated (including aromatic rings), and may optionally be benzofused. Ni- 

50 trogen and sulfur atoms in the ring may be in any oxidized form, including the quatemized form of nitrogen. A hetero- 
cyclylhydrocarbyl may be attached at any endocyclic carbon or heteroatom which results in the creation of a stable 
structure. Prefen-ed heterocydylhydrocarbyls indude 5-7 membered monocydic heterocydes containing one or two 
nitrogen heteroatoms. 

[0083] A substituted heterocydyl hydrocarbyl refers to a heterocydylhydrocarbyl as defined above, wherein at least 
55 one ring atom thereof is bonded to an indicated substituent which extends off of the ring. 

[0084] In referring to hydrocarbyl and hydrocarbylene groups, the term "derivatives of any of the foregoing wherein 
one or more hydrogens is replaced with an equal number of fluorides" refers to molecules that contain carbon, hydrogen 
and fluoride atoms, but no other atoms. 
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[0085] The term "activated ester" is an ester that contains a "leaving group" which is readily displaceable by a nu- 
cleophile, such as an amine, and alcohol or a thiol nucleophile. Such leaving groups are well known and include, without 
limitation, N-hydroxysuccinimide, N-hydroxybenzotriazole, halogen (halides), alkoxy including tetrafluorophenolates, 
thioalkoxy and the like. The term "protected ester" refers to an ester group that is masked or othenA^ise unreactive. 
5 See, e.g.. Greene, "Protecting Groups In Organic Synthesis." 

[0086] In view of the above definitions, other chemical terms used throughout this application can be easily under- 
stood by those of skill in The aTt. Terms may be used alone or in any combination thereof. The preferrecl and "more " 
preferred chain lengths of the radicals apply to all such combinations. 

10 A. GENERATION OF TAGGED NUCLEIC ACID FRAGMENTS 

[0087] As noted above, one aspect of the present invention provides a general scheme for DNA sequencing which 
allows the use of more than 1 6 tags in each lane: with continuous detection, the tags can be detected and the sequence 
read as the size separation is occurring, just as with conventional fluorescence-based sequencing. This scheme is 
15 applicable to any of the DNA sequencing techniques based on size separation of tagged molecules. Suitable tags and 
linkers for use within the present invention, as well as methods for sequencing nucleic acids, are discussed in more 
detail below. 

1 . Tags 

20 

[0088] "Tag", as used herein, generally refers to a chemical moiety which is used to uniquely identify a "molecule of 
interest", and more specifically refers to the tag variable component as well as whatever may be bonded most closely 
to it in any of the tag reactant, tag component and tag moiety. Thus, the tagged molcule, upon cleavage, generates 
essentially a single cleavage product, which is the tag to be analyzed. 
25 [0089] A tag which is useful in the present invention possesses several attributes: 

1 ) It is capable of being distinguished from all other tags. This discrimination from other chemical moieties is based 
on its spectroscopic properties, viz. mass spectroscopy. 

2) The tag is capable of being detected when present at 10-22 {q mole, 

30 3) The tag possesses a chemical handle through which it can be attached to the MOI which the tag is intended to 

uniquely identify. The attachment may be made directly to the MOI, or indirectly through a "linker" group. 

4) The tag Is chemically stable toward all manipulations to which it is subjected, including attachment and cleavage 
from the MOI. and any manipulations of the MOI while the tag is attached to it. 

5) The tag does not significantly interfere with the manipulations performed on the MOI while the tag is attached 
35 to it. For instance, if the tag is attached to an oligonucleotide, the tag must not significantly interfere with any 

hybridization or enzymatic reactions (e.g., PGR sequencing reactions) performed on the oligonucleotide. Similarly, 
if the tag is attached to an antibody, it must not significantly interfere with antigen recognition by the antibody, 

[0090] A tag moiety which is intended to be detected by a certain spectroscopic or px)tentiometric method should 
40 possess properties which enhance the sensitivity and specificity of detection by that method. Typically, the tag moiety 
will have those properties because they have been designed into the tag variable component, which will typically 
constitute the major portion of the tag moiety. In the following discussion, the use of the word "tag" typically refers to 
the tag moiety (/.e., the cleavage product that contains the tag variable component), however can also be considered 
to refer to the tag variable component itself because that is the portion of the tag moiety which is typically responsible 
45 for providing the uniquely detectable properties. In compounds of the formula T-L-X, the "T" portion will contain the tag 
variable component to be characterized by, mass spectrometry; the "T" portion of T-L-X is therefore refenred to as T^^, 
Likewise, the cleavage product from T-L-X that contains T may be refenred to as the T^-containing moiety. The fol- 
lowing spectroscopic and potentiometric methods may be used to characterize 7^^-containing moieties. 

50 Characteristics of MS Tags 

[0091] The tag is analyzable by mass spectrometry (i.e., is a MS-readable tag, also referred to herein as a MS tag 
or "P"^-containing moiety"). The essential feature of the tag is that it is able to be ionized. It is thus a preferred element 
in the design of MS-readable tags to incorporate therein a chemical functionality which can carry a positive or negative 
55 charge under conditions of ioni2:ation in the MS. This feature confers improved efficiency of ion formation and greater 
overall sensitivity of detection, particularly in electrospray ionization. The chemical functionality that supports an ionized 
charge may derive from P"^ or L or both. Factors that can increase the relative sensitivity of an analyte being detected 
by mass spectrometry are discussed in, e.g., Sunner, J., et aL, Anal. Cbem. 60:1300-1307 (1988). 
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[0092] A preferred functionality to facilitate the carrying of a negative charge is an organic acid, such as phenolic 
hydroxy!, carboxyiic acid, phosphonate, phosphate, tetrazole, sulfonyl urea, perfluoro alcohol and sulfonic acid. 
[0093] Prefen-ed functionality to facilitate the carrying of a positive charge under ionization conditions are aliphatic 
or aromatic amines. Examples of amine functional groups which give enhanced detectability of MS tags include qua- 

5 ternary amines {i.e„ amines that have four bonds, each to carbon atoms, see Aebersold. U.S. Patent No. 5.240.859) 
and tertiary amines {i.e. , amines that have three bonds, each to carbon atoms, which includes C=N-C groups such as 
are present in pyridine? see Hess"erar. AnaCBiochem. '224:273, 199'5: Bures eTalT, 7\A7a/. B/oc/)em7 224:364,~1995). 
Hindered tertiary amines are particularly preferred. Tertiary and quatemary amines may be alkyl or aryl. A T^-con- 
taining moiety must bear at least one ionizable species, but may possess more than one ionizable species. The pre- 

10 ferred charge state is a single ionized species per tag. Accordingly, it is preferred that each T^^-containing moiety (and 
each tag variable component) contain only a single hindered amine or organic acid group. 
[0094] Suitable amine-containing radicals that may form part of the 'ps-containing moiety Include the following: 
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15 [0095] The identification of a tag by mass spectrometry Is preferably based upon its molecular mass to charge ratio 
(m/z). The preferred molecular mass range of MS tags is from about 100 to 2.000 daltons. and preferably the 
containing moiety has a mass of at least about 250 daltons, more preferably at least about 300 daltons, and still more 
preferably at least about 350 daltons. It is generally difficult for mass spectrometers to distinguish among moieties 
having parent ions below about 200-250 daltons (depending on the precise instrument), and thus prefered P^-con- 

20 taining moieties of the invention have masses above that range. 

[0096] As explained above, the T^s-containing moiety may contain atoms other than those present in the tag variable 
component, and indeed other than present in T^ itself. Accordingly, the mass of T^ itself may be less than about 250 
daltons, so long as the T»"s-containing moiety has a mass of at least about 250 daltons. Thus, the mass of may 
range from 15 (i.e., a methyl radical) to about 10.000 daltons, and preferably ranges from 100 to about 5,000 daltons, 

25 and more preferably ranges from about 200 to about 1 ,000 daltons. 

[0097] It is relatively difficult to distinguish tags by mass spectrometry when those tags incorporate atoms that have 
more than one isotope in significant abundance. Accordingly, prefen-ed T groups which are intended for mass spec- 
troscopic identification (T^^ groups), contain carbon, at least one of hydrogen and fluoride, and optional atoms selected 
from oxygen, nitrogen, sulfur, phosphorus and iodine. While other atoms may be present in the T^^ their presence 

30 can render analysis of the mass spectral data somewhat more difficult. Preferably, the P"® groups have only carbon, 
nitrogen and oxygen atoms, in addition to hydrogen and/or fluoride. 

[0098] Fluoride is an optional yet prefen-ed atom to have in a group. In comparison to hydrogen, fluoride is, of 
course, much heavier. Thus, the presence of fluoride atoms rather than hydrogen atoms leads to groups of higher 
mass, thereby allowing the V^^ group to reach and exceed a mass of greater than 250 daltons. which is desirable as 
35 explained above. In addition, the replacement of hydrogen with fluoride confers greater volatility on the T^-containing 
moiety, and greater volatility of the analyte enhances sensitivity when mass spelctrometry is being used as the detection 
method. 

[0099] The molecular fomiula of falls within the scope of Ci.5ooNo,iooOo-iooSo.ioPo.io^^af^p^5 wherein the sum 
of a, p and 5 is sufficient to satisfy the othenA^ise unsatisfied valencies of the C, N, O, S and P atoms. The designation 
Ci_5ooNo.iooOo-ioo%ioPo-io^a^p'6 ^eans that T^ contains at least one, and may contain any number ft-om I to 500 
carbon atoms, in addition to optionally containing as many as 100 nitrogen atoms ("Nq." means that T"« need not 
contain any nitrogen atoms), and as many as 100 oxygen atoms, and as many as 10 sulfur atoms and as many as 10 
phosphorus atoms. The symbols a. p and 5 represent the number of hydrogen, fluoride and iodide atoms in T™s, where 
any two of these numbers may be zero, and where the sum of these numbers equals the total of the otherwise unsatisfied 
45 valencies of the C. N. O. S and P atoms. Preferably. Tms has a molecular fonmula that falls within the scope of 
Ci ^No.i oOo-io^^af^p where the sum of a and p equals the number of hydrogen and fluoride atoms, respectively, present 
in the moiety. 

[0100] In addition to the above properties, it is preferred that the tag have a modular chemical structure. This aids 
in the construction of large numbers of structurally related tags using the techniques of combinatorial chemistry. For 

50 example, the T^^ group desirably has several properties. It desirably contains a functional group which supports a 
single ionized charge state when the T"^s^ntaining moiety is subjected to mass spectrometry (more simply referred 
to as a "mass spec sensitivity enhancer" group, or MSSE). Also, it desirably can serve as one member in a family of 
pns-containing moieties, where members of the family each have a different mass/charge ratio, however have approx- 
imately the same sensitivity in the mass spectrometer. Thus, the members of the family desirably have the same MSSE. 

55 In order to allow the creation of families of compounds, it has been found convenient to generate tag reactants via a 
modular synthesis scheme, so that the tag components themselves may be viewed as comprising modules. 
[0101] In a preferred modular approach to the structure of the group. has the fomiula 
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wherein is an organic moiety formed from carbon and one or more of hydrogen, fluoride, iodide, oxygen, nitrogen, 

5 sulfur and phosphorus, having a mass range of 15 to 500 daltons: is an organic moiety formed from carbon and 
one or more of hydrogen, fluoride, iodide, oxygen, nitrogen, sulfur and phosphorus, having a mass range of 50 to 1 000 
daltons; J is a direct borfd or a functional group such as amide, esterTamine.'sulfide, ettfer thioesteri^ disulfide, thioether, ' 
urea, thiourea, carbamate, thiocarbamate, Schiff base, reduced Schiff base, imine, oxime, hydrazone. phosphate, 
phosphonate, phosphoramide, phosphonamide. sulfonate, sulfonamide or carbon-cartjon bond; and n is an integer 

10 ranging from 1 to 50, such that when n is greater than 1, each V and J is independently selected. 

[0102] The modular stoicture T2-(J-T3)j,- provides a convenient entry to families of T-L-X compounds, where each 
member of the family has a different T group. For instance, when T is T^s and each family member desirably has the 
same MSSE, one of the groups can provide that MSSE structure. In order to provide variability between members 
of a family in terms of the mass of V^, the group may be varied among family members. For instance, one family 

15 member may have = methyl, while another has = ethyl, and another has = propyl, etc. 

[01031 In order to provide "gross" or large jumps In mass, a group may be designed which adds significant (e.g., 
one or several hundreds) of mass units to T-L-X. Such a group may be referred to as a molecular weight range 
adjuster groupCWRA") A WRA is quite useful if one is working with a single set of 1^ groups, which will have masses 
extending over a limited range. A single set of groups may be used to create P"^ groups having a wide range of 

20 mass simply by Incorporating one or more WRA groups into the T^s. Thus, using a simple example, if a set of T2 
groups affords a mass range of 250-340 daltons for the T^^^ the addition of a single WRA, having, as an exemplary 
number 100 dalton, as a T^ group provides access to the mass range of 350-440 daltons while using the same set of 
T2 groups, Similarly, the addition of two 100 dalton MWA groups (each as a T^ group) provides access to the mass 
range of 450-540 daltons, where this incremental addition of WRA groups can be continued to provide access to a 

25 very large mass range for the P"* group. Preferred compounds of the fomiula T^(J-T3-)n-L-X have the formula 
Rvwc-(Rwra)w-Rmsse-*--X where VWC is a "T^" group, and each of the WRA and MSSE groups are "T** groups. This 
structure is illustrated in Figure 12, and represents one modular approach to the preparation of V^, 
[0104] In the formula T2-(J-T3-)n-, T^ and T^ are preferably selected from hydrocarbyl, hydrocarbyl-O-hydrocarb- 
ylene. hydrocarbyl-S-hydrocarbylene, hydrocarbyl-NH-hydrocarbylene, hydrocarbyl-amide-hydrocarbylene. N-(hydro- 

30 carbyOhydrocarbylene, N.N-di(hydrocarbyl)hydrocarbylene, hydrocarbylacyl-hydrocarbylene, heterocyclylhydrocarbyl 
wherein the heteroatom(s) are selected firom oxygen, nitrogen, sulfur and phosphoms, substituted heterocyclylhydro- 
carbyl wherein the heteroatom(s) are selected from oxygen, nitrogen, sulfur and phosphorus and the substituents are 
selected from hydrocarbyl, hydrocarbyl -O-hydrocarbylene, hydrocarbyl-NH-hydrocarbylene, hydrocarbyl-S-hydrocar- 
bylene, N-(hydrocarbyl)hydrocarbylene, N,N-di(hydrocarbyl)hydrocarbylene and hydrocarbylacyl-hydrocarbylene. In 

35 addition. T2 and/or T^ may be a derivative of any of the previously listed potential T^ / T^ groups, such that one or more 
hydrogens are replaced fluorides. 

[0105] Also regarding the formula T2-(J-T3.)n-, a prefen-ed T^ has the formula -G(R2)-. wherein G is C^^ alkylene 
chain having a single R2 substituent. Thus, if G is ethylene (-CH2-CH2-) either one of the two ethylene cartjons may 
have a R2 substituent, and R2 is selected from alkyl, alkenyl, alkynyl. cycloalkyl. aryl-fused cydoalkyl, cycloalkenyl, 

40 aryl, aralkyi, aryl-substituted alkenyl or alkynyl, cycloalkyl-substituted alkyl. cydoalkenyl-substituted cycloalkyl, biaryl, 
alkoxy, alkenoxy, alkynoxy, aralkoxy, aryl-substituted alkenoxy or alkynoxy. alkylamino, alkenylamino or alkynylamino, 
aryl-substituted alkylamino. aryl-substituted alkenylamino or alkynylamino, aryloxy, arylamino, N-alkylurea-substituted 
alkyl, N-arylurea-substituted alkyl, alkylcarbonylamino-substituted alkyl, aminocartsonyt-substltuted alkyl. heterocyclyl, 
heterocyclyl-substituted alkyl, heterocyclyl-substituted amino, carboxyalkyi substituted aralkyi, oxocarbocyclyl-fused 

45 aryl and heterocyclylalkyi; cycloalkenyl. aryl-substituted alkyl and, aralkyi, hydroxy-substituted alkyl, alkoxy-substituted 
alkyl, aralkoxy-substituted alkyl. alkoxy-substituted alkyl, aralkoxy-substituted alkyl, amino-substituted alkyl, (aryl-sub- 
stituted alkyloxycarbonylamino)-substituted alkyl. thiol-substituted alkyl, alkylsulfonyl-substituted alkyl, (hydroxy-sub- 
stituted alkyithio)-substituted alkyl, thioalkoxy-substituted alkyl. hydrocarbylacylamino-substituted alkyl. heterocydy- 
lacylamino-substituted alkyl, hydrocarbyl-substltuted-heterocydylacylamino-substituted alkyl, alkylsulfonylamino-sub- 

50 stituted alkyl, arylsulfonyl amino-substituted alkyl, morpholino-alkyi, thiomorpholino-alkyi, morpholino carbonyl-substi- 
tuted alkyl, thiomorpholinocarbonyl-substituted alkyl, [N-(alkyl, alkenyl or alkynyl)- or N.N-[dialkyl, dialkenyl, dialkynyl 
or (alkyl. alkenyl>-amino]carbonyl-substituted alkyl, heterocydylaminocartjonyl. heterocylylalkyleneaminocarbonyl. 
heterocydylaminocartwnyl-substituted alkyl. heterocylylalkyleneaminocarbonyl-substituted alkyl. N.N-[dialkyl]alkyle- 
neaminocarbonyl, N,N-[dialkyl]alkyleneaminocart)onyl-substituted alkyl. alkyl-substituted heterocydylcart)onyl, alkyl- 

55 substituted heterocydylcarbonyl-alkyl. carboxyl-substituted alkyl, dialkylamino-substituted acylaminoalkyi and amino 
acid side chains selected from arginine, asparagine, glutamine, S-methyl cysteine, methionine and corresponding 
sulfoxide and sulfone derivatives thereof, glycine, leucine, isoleucine, allo-isoleucine, tert-leucine. norieucine, pheny- 
lalanine, tyrosine, tryptophan, proline, alanine, omithine, histidine, glutamine, valine, threonine, serine, aspartic acid. 
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beta-cyanoalanine, and allothreonine; alynyl and heterocyclylcaitonyi, aminocarbonyl, amido, mono- or dialkylamino- 
carbonyl, mono- or diary! aminocarbonyl, alkylarylaminocarbonyl, diarylaminocarbonyl, mono- ordiacylaminocarbonyl, 
aromatic or aliphatic acyl. alkyi optionally substituted by substituents selected from amino, carboxy, hydroxy, mercapto. 
mono-or dialkylamino, mono-ordiarylamino. alkylarylamino. diarylamino, mono-ordiacylamino, alkoxy, alkenoxy, ary- 
5 loxy, thioalkoxy, thioalkenoxy. thioalkynoxy, thioaryloxy and heterocyclyl. 

[0106] A preferred compound of the formula T^J-T2-)n-L-X has the structure: 



T 

Amide 



O 

J5 

r 



(CH2)c 



20 



wherein G is (CH2)i^ such that a hydrogen on one and only one of the CH2 groups represented by a single "G" is 
replaced with-(CH2)c-Amide-T'*; and T'* are organic moieties of the formula C^.25No-90o-9*^a^p ^® ^""^ 

of a and p is sufficient to satisfy the otherwise unsatisfied valencies of the C, N, and O atoms; amide is 



25 



o o 

— N-C — or — C-N — ; 
30 R R 



is hydrogen or C^.^q ^'Mi c is an integer ranging from 0 to 4; and n is an integer ranging from 1 to 50 such that 
when n is greater than 1 , G, c, Amide, and T"* are independently selected. 
35 [0107] In a further preferred embodiment, a compound of the fomiula "[^(J-T^-j^-L-X has the structure: 



40 J* 



45 



Amide 

I 



[1 0 (CHa), 
^ Amdc 

55 

wherein is an organic moiety of the formula Ci.25No.90o.9HaFp such that the sum of a and p is sufficient to satisfy 
the otherwise unsatisfied valencies of the C. N, and O atoms; and includes a tertiary or quaternary amine or an 
organic acid; m is an integer ranging from 0-49, and 7^. T*. RS L and X have been previously defined. 
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[0108] Another preferred compound having the formula T2-{J-T3-)n-L-X has the particular structure: 



I 

Amide 



0 (CH2)c r' O 



10 



15 




Anude 



wherein is an organic moiety of the formula C^.25N().90o,9H„Fp such that the sum of a and p is sufficient to satisfy 
the otherwise unsatisfied valencies of the C, N, and O atoms; and includes a tertiary or quaternary amine or an 
organic acid; m is an integer ranging from 0-49, and X^, T^, c, "Amide", L and X have been previously defined. 
20 [0109] In the above structures that have a group, -Amide-T^ is preferably one of the following, which are conven- 
iently made by reacting organic acids with free amino groups extending from "G": 



25 



30 



35 



40 



""^"h""^V^ ' — NHC— ^|^)-0-(C,-C,o)-N(C,-rC,o)2 



(C,-C,o) 



:-(C,-C,o)-N^^ ^ : — NHC-(Co-C,o) 



-NHC- 

O ' — ' O 




—NHC— ^ \l- 



-(C,— C,o); and — nHC-(C,^C,o)-N; 



[0110] Where the above compounds have a group, and the 'G' group has a free carboxyl group (or reactive 
equivalent thereof), then the following are preferred -Amide-JS group, which may conveniently be prepared by reacting 
45 the appropriate organic amine with a free cartx>xyl group extending from a 'G' group: 
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N— X 

— CNH-(C,--C,o)-/ y 



CNH— (Cr-C,o>-N^ O : 



(C,— C,o) 
-CNH— (C2-C,o>-N^^ y ; 



— CNH— (C,— C,o 



^1 C|o) 

N, 



-CNH— (C2-C,o)-N(C,-C,o)2 ; — CNH— (C,— C,o):7-N ; 

o o ^ — 



-CN N(C,— C,o) ; and 




(0111] In three preferred embodiments of the invention, T-L-MOl has the structure: 



I 

Amide 



H 



I O N 

O (CH2)c r' ^(C,— C,o)— ODN— 3— oh 



Rl O 



or the structure: 
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10 



15 




NO, 



(C,— Cjo>— ODN— 3— OH 



20 



25 



30 



35 



or the stmcture: 




^(C ,— C ,o)— ODN —3— OH 



wherein and are organic moieties of the formula Ci.25No.90o.9So^Po-3H„Fpl8 such that the sum of a, p and 5 is 
sufficient to satisfy the otherwise unsatisfied valencies of the C, N, O, S and P atoms; G is (CHj)!^ wherein one and 
only one hydrogen on the CH2 groups represented by each G is replaced with -{CH2)c-Amide-T4, Amide is 



40 



O 
II 

■N-C — or 



O 
II 

— c-N— ; 



is hydrogen or C^.^q alkyi; c is an integer ranging from 0 to 4; "C2-C10" represents a hydrocarbylene group having 
45 from 2 to 10 carbon atoms, "ODN-S'-OH" represents a nucleic acid fragment having a terminal 3' hydroxyl group (i.e., 
a nucleic acid fragment joined to (C^-Cio) at other than the 3' end of the nucleic acid fragment); and n is an integer 
ranging from 1 to 50 such that when n is greater than 1. then G, c, Amide, and are independently selected. 
Preferably there are not three heteroatoms bonded to a single carbon atom. 

[01 1 2] In structures as set forth above that contain a T2-C(=0)-N(R1 y group, this group may be formed by reacting 
50 an amine of the formula HN(R^)- with an organic acid selected from the following, which are exemplary only and do 
not constitute an exhaustive list of potential organic acids: Formic acid. Acetic acid, Propiolic acid, Propionic acid, 
Ruoroaceticacid. 2-Butynoic acid. Cydopropanecarboxylicacid, Butyric acid. Methoxyacetic acid, Difluoroacetic acid. 
4-Pentynoic acid, Cyclobutanecaiboxylic acid, 3,3-Dimethylacrylic acid, Valeric add. N.N-Dimethylglycine, N-Formyl- 
Gly-OH, Ethoxyacetic acid. (Methylthio)acetic acid, Pyn'oIe-2-carboxylic acid, 3-Furoic acid, lsoxazole-5-carboxylic 
55 acid, trans-3-Hexenoic acid, Trifluoroacetic acid, Hexanoic acid, Ac-Gly-OH, 2-Hydroxy-2-methylbutyric acid, Benzoic 
acid. Nicotinic acid, 2-Pyrazinecartx)xylic acid. 1-Methyl-2-pyrrolecarboxylic acid. 2-Cyclopentene-1 -acetic acid, Cy- 
clopentylacetic acid, (S)-(-)-2-Pyrrolidone-5-carboxylic acid, N-Methyl-L-pro!ine, Heptanoic acid, Ac-b-Ala-OH, 2-Ethyl- 
2-hydroxybutyric acid, 2H(2-Methoxyethoxy)acetic acid, p-Toluic acid, 6-Methylnicolinlc acid, 5-Methyl-2-pyrazinecar- 
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boxylic acid. 2,5-Dimethylpyrrole-3-carboxy!ic acid, 4-FIuorobenzoic acid, 3,5-DimethyIiscxazole-4-carboxylic acid, 

3- CydopentylpropiGnic acid, Octanoic acid, N,N-Dimethylsuccinamic acid, Phenylpropiolic acid, Cinnamic acid, 

4- Ethylben20ic acid, p-Anisic acid, 1 ,2,5-Trimethylpyrrole-3-cart>oxylic acid, 3-Fluoro-4-fnethylben2oic acid, . . 
Ac-DL-Propargylglycine, 3-(Trifluoromethyl)butyric acid, 1-Piperidineproplonlcacid. N-Acetylproline, 3,5-Difluoroben- 
zoic acid, Ac-L-Val-OH, lndole-2-carboxylic acid, 2-Ben2ofurancarboxylic acid, Benzotrlazole-5-carboxylic acid, 4-n- 
Propylbenzoic acid, 3rDimethylaminobenzoic acid, 4-Ethoxybenzoic acid. 4-(Methylthiq)benzoic acid, N-(2-Furoy!)gly- 
cine, 2-(Methylthio)nicotinic acid. 3-Fluoro-4-methoxybenzoic acid, Tfa-Gly-OH, 2-Napthoic acid, Quinaldic acid, Ac- 
L-ile-OH, 3-Methylindene-2*carboxylicacid, 2-Quinoxalinecarboxy lie acid, 1-Methylindole-2-carboxylicacid, 2,3.6-Tri- 
fluorobenzolcacid, N-Formyl-L-Met-OH, 2-(2-(2-Methoxyethoxy)ethoxy]aceticacid, 4-n-Butylben2oicacid, N-Benzoylg- 
lycine, 5-Fluoroindole-2-carboxylic acid, 4-n-Propoxybenzoic acid, 4-AcetyI-3,5-dimethyl-2-pyrrolecarboxylic acid, 

3.5- Dimethoxybenzoicacid,2,6-Dimethoxynicotinicacid,Cyclohexanepentanoicacid,2-Naphthylaceticacid,4-(1H^ 
roI-1 -yi)benzoic acid, lndole-3-propionic acid, m-Trifluoromethylbenzoic acid, 5-Methoxyindole-2-carboxylic acid, 
4-Pentylbenzoic acid, Bz-b-Ala-OH, 4-Diethy!aminobenzoic acid, 4-n-Butoxybenzoic acid, 3-Methyl-5-CF3-isoxazole- 
4-carboxylic acid, (3,4-Dimethoxyphenyl)acetic acid, 4-Biphenylcarboxylic add, Pivaloyl-Pro-OH, Octanoyl-Gly-OH, 
(2-Naphthoxy)acetic acid, lndole-3-butyric acid, 4-(Trifluoromethyl)phenytacetic acid, 5-Methoxyindoie-3-acetic acid. 
4-(Trifluoromethoxy)benzoic acid, Ac-L-Phe-OH, 4-Pentyloxyben20ic acid, Z-Gly-OH. 4-Carboxy-N-(fur-2-yImethyl) 
pyrrolidin-2-one, 3,4-Diethoxybenzoic acid, 2,4-Dimethyl-5-C02Et-pyrTole-3-carboxylic acid, N-{2-Fluorcphenyl)sucd- 
namic add. 3,4,5-Trimethoxybenzoic add, N-Phenylanthranilic acid, 3-Phenoxyben20ic acid, Nonanoyi-Gly-OH, 

2- Phenoxypyridine-3-carboxytic acid, 2,5-Dimethyl-1-phenylpyrrole-3-carboxytlc add, trans-4-^Trifluoromethyl)cin- 
namic acid, (5-Methyl-2-phenyloxazol-4-yl)aceticacid, 4-(2-Cydohexenyloxy)ben20icacid, 5-Methoxy-2-methyllndoIe- 

3- acetic acid, trans-4-Cotinlnecarboxylic acid, B2-5-Aminovaleric acid, 4-Hexyloxyben2oic acid, N-(3-Methoxyphenyl) 
succinamic acid, Z-Sar-OH, 4-(3,4-Dimethoxyphenyl)butyric acid, Ac-o-Fluoro-DL-Phe-OH, N-(4-Fluorophenyl)gluta- 
ramic acid, 4'-Ethyl-4-biphenylcarboxyllc acid, 1 ,2,3,4-Tetrahydroacridinecarboxylicacid, 3-Phenoxyphenylacetic acid, 
N-(2,4-Difluorophenyl)succinamicacid,N-Decanoyl-Gly<)H.(+)-6-Methoxy-a-methyl-2-naphthaleneacetic^ 
fluoromethoxy)cinnamic acid, N-Formyl-DL-Trp-OH. (R)-(+)-a-Methoxy-a-(trifluoromethyl)phenylacetic acid, 
Bz-DL-Leu-OH, 4-(Trifluoromethoxy)phenoxyacetic add, 4-Heptyloxybenzoic acid, 2,3.4-Trimethoxycinnamic acid, 

2.6- Dimethoxybenzoyl-Gly-OH, 3-(3,4,5-Trimethoxyphenyl)propionic acid, 2,3,4,5,6-Pentafluorophenoxyacetic acid, 
N-(2,4-Difluorophenyl)glutaramlcacid, N-Undecanoyl-Gly-OH, 2-(4-Fluorobenzoy!)benzoic acid, 6-Trifluoromethoxyin- 
do!e-2-carboxylic acid, N-(2,4-Difluoropheny!)diglycolamicacid, Ac-L-Trp-OH, Tfa-L-Phenylglycine-OH, 3-lodobenzoic 
acid, 3-(4<i-Pentylben2oyl)propionic acid. 2-PhenyM-quinolinecarboxylic add, 4-Octyloxybenzoic acid, B2-L-Met-0H, 
3,4,5*Triethoxybenzo)C acid, N-Lauroyl-Gly-OH, 3,5-Bis(trifluoromethyl)ben2oic acid, Ac-5-Methyl-DL-Trp-OH, 2-lodo- 
phenylacetic add, 3-lodo-4-methylbenzoic acid, 3-(4-n-Hexylbenzoyl)propionic acid, N-Hexanoyl-L-Phe-OH, 4-Nony- 
loxyben2oic acid, 4*-(Trifiuoromethyl)-2-biphenylcarboxylicacid, Bz-L-Phe-OH. N-Tridecanoyl-Gly-OH, 3,5-Bis(trifluor- 
omethyl)phenylacetic acid, 3-(4-n-Hepty!benzoyl)propionic acid, N-Hepytanoyl-L-Phe-OH. 4-Decyloxybenzoic acid, 
N-(a.a,a-trifluoro-m-tolyl)anthranilic acid, Niflumic add. 4-(2-Hydroxyhexafluoroisopropyl)benzoic acid. N-Myristoyl- 
Gly-OH, 3-(4-n-Octylbenzoyl)propionic acid, N-Octanoyl-L-Phe-OH, 4-Undecyloxyben2oic acid, 3-(3,4,5-Trimethoxy- 
phenyl)propionyl-Gly-OH, 8-lodonaphthoic acid, N-Pentadecanoyl-Gly-OH, 4-DodecyIoxybenzoic acid, N-Palmitoyl- 
Gly-OH, and N-Stearoyl-Gly-OH. These organic acids are available from one or more of Advanced ChemTech, Lou- 
isville, KY; Bachem Bioscience Inc., Torrance, CA; Calbiochem-Novabiochem Corp., San Diego, CA; Farchan Labo- 
ratories inc., Gainesville FL; Lancaster Synthesis. Windham NH: and MayBridge Chemical Company (c/o Ryan Sci- 
entific). Columbia, SC. The catalogs from these companies use the abreviations which are used above to identify the 
acids. 

f. Combinatorial Chemistry as a Means lor Preparing Tags 

[0113] Combinatorial chemistry is a type of synthetic strategy which leads to the production of large chemical libraries 
(see, for example. PCT Application Publication No. WO 94/08051 ). These combinatorial libraries can be used as tags 
for the identification of molecules of interest (MOIs). Combinatorial chemistry may be defined as the systematic and 
repetitive, covalent connection of a set of different "building blocks" of varying structures to each other to yield a large 
array of diverse molecular entities. Building blocks can take many forms, both naturally occurring and synthetic, such 
as nudeophiles, electrophiles, dienes, alkylating or acylating agents, diamines, nucleotides, amino acids, sugars, lipids, 
organic monomers, synthons, and combinations of the above. Chemical reactions used to connect the building blocks 
may involve alkylation, acylation. oxidation, reduction, hydrolysis, substitution, elimination, addition, cydization, con- 
densation, and the like. This process can produce libraries of compounds which are oligomeric, non-oligomeric. or 
combinations thereof. If oligomeric, the compounds can be branched, unbranched, or cydic. Examples of oligomeric 
structures which can be prepared by combinatorial methods indude oligopeptides, oligonucleotides, oligosaccharides, 
polylipids. polyesters, polyamides, polyurethanes, polyureas, polyethers, poly(phosphorus derivatives), e.g., phos- 
phates, phosphonates. phosphoramides, phosphonamides, phosphites, phosphinamides. etc., and poly(sulfur deriv- 
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atives), e.g.. sulfones, sulfonates, sulfites, sulfonamides, sulfenamides, etc, 

[01 1 4] One common type of oligomeric combinatorial library is the peptide combinatorial library. Recent innovations 
in peptide chemistry and molecular biology have enabled libraries consisting of tens to hundreds of millions of different 
peptide sequences to be prepared and used. Such libraries can be divided into three broad categories. One category 

5 of libraries involves the chemical synthesis of soluble nonsupport-bound peptide libraries [e.g., Houghten et ai.. Nature 
- 354:84,- 1 99.1 ).. A second.category involves_the chemjcaj synthesis of support-bound peptide libraries, presented on 
solid supports such as plastic pins, resin beads, or cotton (Geysen et al., MoL Immunol. 23:709, 1986; Lam et 'al., 
Nature 354:82, 1991 ; Eichler and Houghten, Biochemistry 32:11035. 1993). In these first two categories, the building 
blocks are typically L-amino acids. D-amino acids, unnatural amino adds, or some mixture or combination thereof. A 

10 third category uses molecular biology approaches to prepare peptides or proteins on the surface of filamentous phage 
particles or plasmids (Scott and Craig, Curr. Opinion Biotech. 5:40, 1 994). Soluble, nonsupport-t)ound peptide libraries 
appear to be suitable for a number of applications, including use as tags. The available repertoire of chemical diversities 
in peptide libraries can be expanded by steps such as permethylation (Ostresh et al., Proc. Natl. Acad. ScL, USA 91: 
11138, 1994). 

15 [0115] Numerous variants of peptide combinatorial libraries are possible in which the peptide backbone is modified, 
and/or the amide trands have been replaced by mimetic groups. Amide mimetic groups which may be used include 
ureas, urethanes, and carbonyl methylene groups. Restructuring the backbone such that sidechains emanate from the 
amide nitrogens of each amino acid, rather than the alpha-carbons, gives libraries of compounds known as peptoids 
(Simon et al., Proc. Natl. Acad. Sci., USA 89:9367, 1992). 

20 [0116] Another common type of oligomeric combinatorial library is the oligonucleotide combinatorial library, where 
the building blocks are some form of naturally occurring or unnatural nucleotide or polysaccharide derivatives, including 
where various organic and inorganic groups may substitute for the phosphate linkage, and nitrogen or sulfur may 
substitute for oxygen in an ether linkage (Schneider et al., Biochem. 34:9599, 1995; Freier et al.. J. Med. Chem. 38: 
344, 1995; Frank. J. Biotechnology 47:259. 1995; Schneider et al.. Published PCT WO 942052; Ecker et al.. Nucleic 

25 Acids Res. 2f:1853. 1993). 

[01 1 7] More recently, the combinatorial production of collections of non-oligomeric. small molecule compounds has 
been described (DeWitt et al., Proc. Natl. Acad. ScL, USA 90:690, 1993; Bunin et al., Proc. Natl. Acad. ScL, USA 91: 
4708, 1994), Structures suitable for elaboration into small-molecule libraries encompass a wide variety of organic 
molecules, for example heterocyclics, aromatics, alicyclics, aliphatics, steroids, antibiotics, enzyme inhibitors, ligands. 

30 hormones, drugs, alkaloids, opioids, terpenes, porphyrins, toxins, catalysts, as well as combinations thereof. 

g. Specific Methods hr Combinatorial Synthesis of Tags 

[0118] Two methods for the preparation and use of a diverse set of amine-containing MS tags are outlined below. 
35 In both methods, solid phase synthesis is employed to enable simultaneous parallel synthesis of a large number of 
tagged linkers, using the techniques of combinatorial chemistry. In the first method, the eventual cleavage of the tag 
from the oligonucleotide results in liberatk)n of a cart)oxyl amide. In the second method, cleavage of the tag produces 
a carix)xylic acid. The chemical components and linking elements used in these methods are abbreviated as follows: 



40 


R = 


resin 




FMOC = 


fluorenylmethoxycarbonyl protecting group 




All = 


allyl protecting group 




C02H = 


carboxylic acid group 




CONH2 = 


carboxylic amide group 


45 


NH2 = 


amino group 




OH = 


hydroxyl group 




CONH = 


amide linkage 




COO = 


ester linkage 




NH2 - Rink - CO2H = 


4-[(a-amino)-2,4-dimethoxyben2yl]-phenoxybutyric acid (Rink linker) 


50 


OH-1MeO-C02H = 


(4-hydroxymethyl )phenoxybutyric acid 




OH - 2Me0 - CO2H = 


(4-hydroxymethyl-3-'methoxy)phenoxyacetic acid 




NH2-A-COOH = 


amino acid with aliphatic or aromatic amine functionality in side chain 




X1.,..Xn-C00H = 


set of n diverse cartK)xylic acids with unique molecular weights 




oligol... oligo(n) = 


set of n oligonucleotides 


55 


HBTU = 


0-ben zotri azoM -yl -N , N , N' , N'-tetramelhyluron ium hexafluorophosphate 




[0119] The sequence of steps in Method 1 is as follows: 
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OH-2MeO-CONH.R 

i FMOC - NH - Rink - CO.H; couple (e.g. , HBTU) 

_ . FMOC -NH- Rink.- COO -_2MeO-CONH.^^ 

i piperidine (remove FMOC) 
10 NH, • Rink - COO - 2MeO - CONH - R 

i FMOC • NH - A - COOH; couple (e,g. , HBTU) 
FMOC - NH . A - CONH - Rink - COO - 2MeO - CONH - R 

15 

i piperidine (remove FMOC) 

NHj . A - CONH • Rink - COO - 2MeO - CONH - R 

20 i divide into n aiiquots 

Hill couple to n different acids X I Xn - COOH 

XI Xn -CONH -A- CONH -Rink- COO- 2MeO- CONH -R 

25 4'ii4>l' Cleave tagged linkers from resin with 1 % TFA 

XI Xn - CONH " A -CONH - Rink - CO2H 

44441 couple to n oligos (oligol oligo(n)) 

30 (<?'g , via Pfp esters) 

XI Xn • CONH - A • CONH - Rink - CONH - oligol oligo(n) 

4 pool tagged oligos 

35 4 perform sequencing reaction 

4 separate different length fragments from 

sequencing reaction (e.g. , via HPLC or CE) 
4 cleave tags from linkers with 25%- 1 00% TFA 

40 XI Xn -CONH -A -CONH 

4 

analyze by mass spectrometry 

45 

[0120] The sequence of steps in Method 2 is as follows: 



50 



55 
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OH- IMeO-CO,- All 

i FMOC - NH • A • CO,H; couple (e.g., HBTU) 
FMOC.NH-A-c66-lMe6.~Ca-V» -- - . . _ . . _ 

i Palladium (remove Allyl) 
FMOC - NH - A - COO - IMeO - CO.H 

i OH - 2MeO - CONH • R; couple (e,g. , HBTU) 
FMOC . NH • A - COO - 1 MeO - COO - 2MeO - CONH • R • 

i piperidine (remove FMOC) 
NH, - A - COO - IMeO - COO - 2MeO - CONH - R 

i divide into n aliquots 
444444 couple to n different acids XI Xn - CO2H 

XI Xn - CONH - A - COO • IMeO - COO - 2MeO - CONH - R 

iiJ-ii cleave tagged linkers from resin with 1% TFA 

XI ..... Xn - CONH - A - C(X) - IMeO - COjH 

iiiii couple to n oligos (oligol oHgo(n)) 

(e.g., via Pfp esters) 

XI Xn - CONH • A - COO - IMeO - CONH - oligol oligo(n) 

i pool tagged oligos 

i perform sequencing reaction 

i separate different length fragments from 

sequencing reaction (e.g., via HPLC or CE) 
i cleave tags from linkers with 25-1 00% TFA 

XI Xn - CONH ^ A • COjH 

i 

analyze by mass spectrometry 

h. Phosphoramidite and Related Methods of Tag Synthesis 

[0121] Solid phase synthesis of natural polymers was originally developed simultaneously by Merrifield (Men*ifield, 
1963) and for peptide chemistry and subsequently adapted to oligonucfeotide synthesis by Letsinger (Letsinger and 
Mahadevan 1 965). The concept has four basic aspects: L The oligonucleotide Is synthesized while covalently attached 
to a solid support; II. Excess soluble protected nucleotides and coupling reagent can drive a reaction nearty to com- 
pletion; HI. The reaction is carried out in a single reaction vessel to diminish mechanical losses due to solid support 
manipulation allowing synthesis with minute quantities of starting materials; and IV. The heterogeneous reactions are 
standardized, and these procedures are easily automated. 

[01 22] The most widely used method for synthesizing oligonucleotides is the phosphite-triester approach. Another. 
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but less common method of synthesis is the H-phosphonate approach. 
The Phosphite-Triester Approach for Oligonucleotide Synthesis 

5 [0123] The development of this procedure was initiated in 1975, when Letsinger (Letsinger et.al. 1975) introduced 

- - the symmetrical phosphite. reagent methoxyphpsphpdichloridite. Although coupling times were dramatically reduced, 
this compound has the drawback of being too reactive, making handling very difficult at room temperature an^ storage 
of phophite monomers impossible even at -10**C. Reaction with protected nucleosides results in the production of large 
quantities of symmetrical 3'-3* dimer (Letsinger et.aK. 1982). 

10 [0124] In 1981 the introductton of the new phosphitylating agent N.N-dimethylaminomethoxyphosphine (Beaucage 
and Caruthers 1981) resolved not only the problem of the formation of 3-3' dimers during phosphitylation, but also 
resulted in generating deoxyribonucleoside phosphite derivatives which are to a certain extent stable towards oxygen 
and atmospheric moisture at room temperature. The most useful compound proved to be N.N-diisopropylamine (Adams 
et.al. 1983, McBride and Caruthers 1983) which can be purified easily on silica gel column and are stable as dry 

15 powders at room temperature. 

[0125] The 5'-protecting group used by this method is the dimethoxytrityl (DMT). This group is completely cleaved 
by treatment with trichloroacetic acid (1-3 % w/v) in dichloromethane in less than 1 minute. Once the protecting group 
is removed, the free 5'-hydroxyl is available for coupling to the next nucleoside building block. 
[01 26] Unlike the phosphodichloridites in Letsinger's approach and the phosphomonochloridite/ tetrazol in the original 

20 woric of Caruthers (Caruthers et al. 1980). the phosphoramidites cannot react directly with a free 5'-hydroxyl function 
on a growing chain. They must first be activated by treatment with a weak acid such as tetrazole. Tetrazole has been 
shown (Berner et al., 1990; Dahl et al.,1990) to play a dual role: it protonates the dialkylamino group of the phophor- 
amidite function; and next, acts as a nucleophile, generating a very reactive tetrazolophosphane intermediate. Coupling 
reactions with these deoxynucleoside-phosphoramidite reagents are very fast (less than 2 min) and almost quantitative. 

25 [0127] Since the coupling reactk)n cannot be quantitative in a finite time period, a small percentage of tmncated 
sequences is produced at every coupling step. These reaction failures contain 5'-hydroxyls. If these failure sequences 
were allowed to react further, it would be difficult to isolate the product from the sequence mixture. This problem is 
largely overcome by capping the remaining free 5'-hydroxyls by acetylation. This capping step is achieved with the 
strong acetylation reagent N-acetyl-dimethylaminopyridinium ion, which forms on reaction of equimolar amounts of 

30 acetic anhydride and 4-dimethylaminopyridine (DMAP). The reaction is neariy quantitative In 0.5 minutes. Using N- 
methyl-lmidazole instead of DMAP the oligonucleotide will have improved biological properties (Ferrance et al., 1989). 
[01 28] The newly formed phosphite intemucleotide linkage is unstable and susceptible to both acidic and basic cleav- 
age. Therefore, after capping, the trivalent phosphite triester is oxidized to a stable pentavalent phosphate triester. 
Iodine is used as a mild oxidant in basic tetrahydrofuran solution with water as the oxygen donor. The reaction is 

35 extremely fast, being quantitative in 30 seconds. Oxidation completes the nucleotide addition cycle. Chain extension 
can continue on removing the dimethoxytrityl group at the 5'-end of the growing chain and repeating another cycle of 
nucleotide addition. Completion of the cleavage from support and simultaneous base and phosphate deprotection are 
achieved by treatment with concentrated ammonium hydroxide. 

40 H-Phosphonate Method for Oligonucleotide Synthesis 

[01 29] The use of a nucleoside H-phosphonate was first reported by Todd and collaborators (Hall et al. 1 957). There- 
after. H-phosphonate chemistry remained unexplored until the 1980s. In 1985 and 1986 the resurgence of this approach 
was introduced by Garegg et al. (1985, 1986a, 1986b) and Froehler et al. (1986a and 1986 b). 
45 [0130] In this method the activable monomer is a 5*-DMT-base-protected-nucleoside S'-hydrogen-phosphonate. In 
these monomers the presence of the H-phosphonate moiety makes phosphate protection unnecessary. 
[01 31] The same base protecting groups are used as in the phosphite triester approach. Since the protection strategy 
is the same for the hydroxyl and the exocyclic amines on the heterocycles the deprotection 

[0132] H-phosphonate synthesis cycle. There is no oxidation step during chain elongation in oligonucleotide synthe- 
50 sis according to the H-phosphonate method. Oxidation is carried out at the end of the synthesis. 

[01 33] The coupling process in H-phosphonate synthesis is activated by a hindered acyl chloride, and the anhydride 
formed is used to react with a free oligonucleotide 5'-hydroxyl end, forming an H-phosphonate analog of the internu- 
cleotidic linkage. Yields are about 96-99%. PivaloyI chloride and 1-adamantane carbonyl chloride (Andrus et al, 1988) 
were reported to be the best activators. However, some side reactions between the condensing reagent and the starting 
55 material are observed during condensation, and it leads to decreasing yield of the desired compound. In particular, 
preactivation of a nudeosidic 3'-H-phosphonate followed by the addition to a OH-component, that usually takes place 
in the synthesis on polymer supports, resulted in lower yields of the H-phosphonate diesters. The other side reaction 
Is a modification of heterocyclic bases of nucleotides (acylation or phosphitylation of guanine and thymine) during 
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condensation. The capping reagent of phosphoramidite chemistry (acetic anhydride/N-methylimidazole) is not suitable 
for the H-phosphonate approach. Cyanoethyl-H-phosphonate (Gaffney and Jones. 1 988) or iso-propyl-H-phosphonate 
(Andrus et aL, 1988) activated by acyl chloride or PFPC can be used. 

[0134] After completion of the sequence all H-phosphonate bonds are simultaneously oxidized to phosphodiester 
5 linkages. Instead of oxidation with iodine H-phosphonate-deoxyribonudeotides are able to be converted into DNA- 

analogues as phosphorothioates, phosphoroamidates or phosphotriesters (Froehler, 1986c; Froehler et al.,1988). The 

advantages of this method are the increased monomer sfability, the~preparation of 35S-iabeIed oligonucleotides (Stein 

et al, 1990) and the possibility of reusing the excess of activated nucleoside that did not react (Seliger and Rosch. 

1990), H-phosphonates are commercially available from Glen Research (Herndon, VA). 
10 [01 35] The present invention provides tags that are readily incorporated into the standard oligonucleotide syntheses 

described above. The present invention provides a CMST-label for attachment to a nucleic acid during solid phase 

synthesis. The CMST-label is phosphoramidite of a CMST 

(01 36] One method according to the present invention involves attaching a CMST-label to a nucleic acid during solid 
phase synthesis. Specifically, a phosphoramidite of a CMST is condensed to a support-bound oligodeoxynucleotide. 

15 Adaptation of solid phase phosphoramidite chemistry to CMST technology simplifies purification and facilitate auto- 
mation of the procedure thus increasing throughput. This allows for easy preparationof the tagged molecules. 
[0137] As described above, the phosphoramidite method of oligonucleotide synthesis has been automated and Is 
widely used. A phosphoramidite of a CMST tag provides a convenient method of tagging oligonucleotides with CMST 
tags. According to the present invention, a phosphoramidite of a CMST Tag is condensed to a oligonucleotide chain 

20 bound to a solid support. The support jnay be of any sort useful for the solid phase synthesis of nucleic acids. The 
polynucleoside may be a ribonudeoside or a deoxyribonucleoside phosphodiesters. Also, polynucleoside analogs 
including but not limited to phosphorothioates, methylphosphonates. etc. may be employed. 
[01 38] A CMST phosphoramidite according to the present invention may be of the general stnjcture T^'IS-L-X where 
T^s is the detected by a mass spec. L is a photochemical linker and X is a moeity allowing coupling to a polynudeoside 

25 on a solid support. 

[01 39] Automated synthesis of oligonucleotides utilises phosphoramidite chemistr/ and a variety of phosphoramidite 
reagents (e.g., biotin2) have been developed which take advantage of this chemistry to label oligonudeotides (see 
Figure 24)3. Adaptation of this chemistry to allow mass tagging of solid supported oligonudeotides simplifies the pu- 
rification and facilitates automation of the procedure, thus increasing throughput. 
30 [0140] A proposed scheme starting from the tag acid already prepared is shown in Figure 25, 

2. Linkers 

[01 41] A "linker" component (or L), as used herein, means either a direct covalent bond or an organic chemical group 

35 which is used to connect a "tag" (or T) to a "molecule of interest" (or MOI) through covalent chemical bonds, in addition, 
the direct bond itself, or one or more bonds within the linker component is cleavable under conditions which allows T 
to be released (in other words, cleaved) from the remainder of the T-L-X compound (induding the MOI component). 
The tag variable component which is present within T should be stable to the cleavage conditions. Preferably, the 
deavage can be accomplished rapidly; within a few minutes and preferably within about 15 seconds or less. 

40 [0142] In general, a linker is used to connect each of a large set of tags to each of a similarly large set of MOls, 
Typically, a single tag-linker combination is attached to each MOI (to give various T-L-MOl), but in some cases, more 
than one tag-linker combination may be attached to each individual MOI (to give various (T-L)n-MOI). In another em- 
bodiment of the present invention, two or more tags are bonded to a single linker through multiple, independent sites 
on the linker, and this multiple tag-linker combination is then bonded to an individual MOI (to give various (T)n-L-MOI). 

45 [0143] After various manipulations of the set of tagged MOls, special chemical and/or physical conditions are used 
to cleave one or more covalent bonds in the linker, resulting in the liberation of the tags from the MOls. The cleavable 
bond(s) may or may not be some of the same bonds that were fomied when the tag. linker, and MOI were connected 
together. The design of the linker will, in large part, detemiine the conditions under which cleavage may be accom- 
plished. Accordingly, linkers may be identified by the deavage conditions they are particulariy susceptible too. When 

50 a linker is photolabile (i.e., prone to cleavage by exposure to actinic radiation), the linker may be given the designation 
L^o. Likewise, the designations L^^^, [tase^ \_[0]^ \_IR]^ leoz^ L^'c, and L^s may be used to refer to linkers that are 
particulariy susceptible to cleavage by acid, base, chemical oxidation, chemical reduction, the catalytic activity of an 
enzyme (more simply "enzyme"), electrochemical oxidation or reduction, elevated temperature ("thermal") and thiol 
exchange, respectively 

55 [0144] Certain types of linker are labile to a single type of deavage condition, whereas others are labile to several 
types of deavage conditions. In addition, in linkers which are capable of bonding multiple tags (to give (T)n-L-MOI type 
structures), each of the tag-t>onding sites may be labile to different deavage conditions. For example, in a linker having 
two tags bonded to it, one of the tags may be labile only to base, and the other labile only to photolysis. 
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[0145] A linker which is useful in the present invention possesses several attributes: 

1) The linker possesses a chemical handle (L^) through which it can be attached to an MOI. 

2) The linker possesses a second, separate chemical handle (L^) through which the tag is attached to the linker. 
5 If multiple tags are attached to a single linker ((T)n-L>MOI type structures), then a separate handle exists for each 

tag. . _ _ 

3) The linker is stable toward all manipulations to which it is subjected, with the exception of the conditions which' 
allow cleavage such that a T-containing moiety is released from the remainder of the compound, including the 
MOi. Thus, the linker is stable during attachment of the tag to the linker, attachment of the linker to the MOI, and 

10 any manipulations of the MOI while the tag and linker (T-L) are attached to it 

4) The linker does not significantly interfere with the manipulations performed on the MOI while the T-L Is attached 
to it. For instance, if the T-L is attached to an oligonucleotide, the T-L must not significantly Interfere with any 
hybridization or enzymatic reactions (e.g., PGR) performed on the oligonucleotide. Similariy. if the T-L is attached 
to an antibody, it must not significantly interfere with antigen recognition by the antibody 

15 5) Cleavage of the tag from the remainder of the compound occurs in a highly controlled manner, using physical 

or chemical processes that do not adversely affect the detectability of the tag. 

[0146] For any given linker, it is preferred that the linker be attachable to a wide variety of MOts. and that a wide 
variety of tags be attachable to the linker. Such flexibility is advantageous because It allows a library of T-L conjugates, 
20 once prepared, to t>e used with several different sets of MOIs. 
[0147] As explained above, a preferred linker has the formula 

Lh-L^-L^-L^-L, 

25 

wherein each Lf, is a reactive handle that can be used to link the linker to a tag reactant and a molecule of interest 
reactant. L^ is an essential part of the linker, because L^ imparts lability to the linker. L^ and L^ are optional groups 
which effectively serve to separate from the handles L^,. 

[0148] L^ (which, by definition, is nearer to T than is L^), serves to separate T from the required labile moiety L^. This 
30 separation may be useful when the cleavage reaction generates particularly reactive species (e.g. . firee radicals) which 
may cause random changes in the structure of the T-containing moiety. As the deavage site is further separated from 
the T-containing moiety, there is a reduced likelihood that reactive species formed at the cleavage site will disrupt the 
structure of the T-containing moiety. Also, as the atoms in LI will typically be present in the T-containing moiety, these 
L*" atoms may impart a desirable quality to the T-containing moiety. For example, where the T-containing moiety is a 
35 T'^^-containing moiety, and a hindered amine is desirably present as part of the structure of the T>^®-containing moiety 
(to serve, e.g., as a MSSE), the hindered amine may be present in L^ labile moiety. 

[01 49] In other instances. L^ and/or L^ may be present in a linker component merely because the commercial supplier 
of a linker chooses to sell the linker in a form having such a L^ and/or L^ group. In such an instance, there is no harm 
in using linkers having L^ and/or L^ groups, (so long as these group do not inhibit the cleavage reaction) even though 
40 they may not contribute any particular performance advantage to the compounds that incorporate them. Thus, the 
present invention allows for L^ and/or L^ groups to be present In the tinker component 

[01 50] L^ and/or L^ groups may be a direct bond (in which case the group is effectively not present), a hydrocarbylene 
group (e.g., alkylene, arylene, cycloalkylene. etc.). -O-hydrocarbylene (e.g.. -O-CH2-, 0-CH2CH(CH3)-, etc.) or hydro- 
carbylene-(0-hydrocarbylene)w- wherein w is an integer ranging firom 1 to about 10 (e.g., -CH2-0-Ar-, 

45 -CH2-(0-CH2CH2)4-. etc.). 

- [01 51] With the advent of solid phase synthesis, a great body of literature has developed regarding linkers that are 
labile to specific reaction conditions. In typical solid phase synthesis, a solid support is bonded through a labile linker 
to a reactive site, and a molecule to be synthesized is generated at the reactive site. When the molecule has been 
completely synthesized, the solid support-linker-molecule construct is subjected to cleavage conditions which releases 

50 the molecule from the solid support. The labile linkers which have been developed for use in this context (or which 
may be used in this context) may also be readily used as the linker reactant in the present Invention. 
[0152] Lloyd-Williams, P., et at, "Convergent Solid-Phase Peptide Synthesis", Tetrahedron Report No. 347, 49(48): 
11065-11133 (1993) provides an extensive discussion of linkers which are labile to actinic radiation (i.e., photolysis), 
as well as acid, base and other cleavage conditions. Additional sources of information about labile linkers may be 

55 readily obtained. 

[0153] As described above, different linker designs will confer cleavabillty ("lability") under different specific physical 
or chemical conditions. Examples of conditions which serve to cleave various designs of linker include acid, base, 
oxidation, reduction, fluoride, thiol exchange, photolysis, and enzymatic conditions. 
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[0154] Examples of cleavable linkers that satisfy the general criteria for linkers listed above will be well known to 
those in the art and include those found in the catalog available from Pierce (Rockford, IL). Examples include: 

• ethylene glycobis(succinimidylsuccinate) (EGS), an amine reactive cross-linking reagent which is cleavable by 
5 hydroxylamine (1 M at 37°C for 3-6 hours); 

- • disuccinimidyl tartarate (DST) and sulfo-DST. whLch_are amine reactive cross-linking reagents, cleavable by 0.01 5 
M sodium periodate; 

• bis{2-(succinimidyloxycarbonyloxy)ethyl]sulfone (BSOCOES) and sulfo-BSOCOES, which are amine reactive 
cross-linking reagents, cleavable by base (pH 11.6); 

10 • ^ ,4-di-[3*-(2'-pyridytdithio(propionamldo))butane (DPDPB), a pyridyldithtol crosslinker which is deavable by thiol 
exchange or reduction; 

N-[4-{p-azidosaIicylamido)-butyl)-3 '(2'-pyridydithio)propionamide (APDP), a pyridyldithiol crosslinker which is 
cleavable by thiol exchange or reduction; 

• bis-[beta-4-(azidosalicylamido)ethyl]-disulfide. a photoreactive crosslinker which is cleavable by thiol exchange or 
^5 reduction; 

N-succinimidyl-(4-azidophenyl)-1 ,3'dithiopropionate (SADP). a photoreactive crosslinker which is cleavable by thi- 
ol exchange or reduction; 

• sulfosuccinimidyl-2-(7-azido-4-methylcoumarin-3-acetamide)ethyl-1 ,3*-dithiopropionate (SAED), a photoreactive 
crosslinker which is cleavable by thiol exchange or reduction; 

20 • sulfosuccinimidyl-2-(m-azido-o-nitrobenzamido)-ethyl-1,3'dithiopropionate (SAND), a photoreactive crosslinker 
which is cleavable by thiol exchange or reduction. 

[0155] Other examples of cleavable tinkers and the cleavage conditions that can be used to release tags are as 
follows. A silyl linking group can be cleaved by fluoride or under acidic conditions. A 3-, 4-, 5-, or 6-substituted-2-ni- 

25 trobenzytoxy or 2-, 3-. 5-, or 6-substituted^-nitrobenzyloxy linking group can be cleaved by a photon source (photol- 
ysis), A 3-. 4-, 5-, or 6-substituted-2-alkoxyphenoxy or 2-, 3-, 5-, or 6-substituted-4-alkoxyphenoxy linking group can 
be cleaved by Ce(NH4)2(N03)6 (oxidation). A NCO2 (urethane) linker can be cleaved by hydroxide (base), acid, or 
LiAlH4 (reduction). A 3-penteny!, 2-butenyl, or 1-butenyl linking group can be cleaved by O3, OSO4/IO4-, or KMn04 
(oxidation). A 2-[3-, 4-, or 5-substituted-furyl]oxy linking group can be cleaved by O2, Brj, MeOH, or acid. 

30 [01 56] Conditions for the cleavage of other labile linking groups include: t-alkytoxy linking groups can be cleaved by 
acid; methyl(dialkyl)methoxy or 4-substituted-2-alkyl-1 ,3-dioxlane-2-yl linking groups can be cleaved by H3O; 2-si- 
lylethoxy linking groups can be cleaved by fluoride or acid; 2-(X)-ethoxy (where X = keto, ester amide, cyano, NO2, 
sulfide, sulfoxide, sulfone) linking groups can be cleaved under alkaline conditions; 2-, 3-, 4-, 5-, or 6-substituted- 
benzyloxy linking groups can be cleaved by acid or under reductive conditions; 2-butenyloxy linking groups can be 

35 cleaved by (Ph3P)3RhCI(H), 3-, 4-, 5-, or 6-substituted-2-bromophenoxy linking groups can be cleaved by Li, Mg, or 
BuLI; methylthiomethoxy linking groups can be cleaved by Hg^-; 2-(X)-ethyloxy (where X = a halogen) linking groups 
can be cleaved by Zn or Mg; 2-hydroxyethyloxy linking groups can be cleaved by oxidation {e.g., with Pb(0Ac)4). 
[01 57] Preferred linkers are those that are cleaved by acid or photolysis. Several of the acid-labile linkers that have 
been developed for solid phase peptide synthesis are useful for linking tags to MOIs. Some of these linkers are de- 

^0 scribed in a recent review by Lloyd-Williams et al. (Tetrahedron 49:11065-11133, 1993). One useful type of linker is 
based upon p-alkoxybenzyl alcohols, of which two, 4-hydroxymethylphenoxyacetic acid and 4-(44iydroxymethyl- 
3-methoxyphenoxy)butyric acid, are commercially available from Advanced ChemTech (Louisville. KY). Both linkers 
can be attached to a tag via an ester linkage to the benzylalcohol. and to an amine-containing MOI via an amide linkage 
to the carboxylic acid. Tags linked by these molecules are released from the MOI with varying concentrations of trif- 

45 luoroacetic acid. The cleavage of these linkers results In the liberation of a carboxylic add on the tag. Acid deavage 
of tags attached through related linkers, such as 2,4-dimethoxy-4'-(carboxymethyloxy)-benzhydrylamine (available 
from Advanced ChemTech in FMOC-protected form), results in liberation of a carboxylic amkje on the released tag. 
[01 58] The photolabile linkers useful for this application have also been for the most part developed for solid phase 
peptide synthesis (see Lloyd-Will lams review). These linkers are usually based on 2-nltrobenzylesters or 2-nltroben- 

50 zylamides. Two examples of photolabile linkers that have recently been reported in the literature are 4-(4-(1 -Fmoc- 
amino)ethyl)-2-methoxy-5-nltrophenoxy)butanoic acid (Holmes and Jones, J. Org, Chem. 60:2318-2319, 1995) and 
3-(Fmoc-amino)-3-(2-nitrophenyl)propionicacid (Brown et a!., Molecular Diversity 1:4-12, 1995). Both linkers can be 
attached via the carboxylic acid to an amine on the MOI. The attachment of the tag to the linker is made by forming 
an amide between a carboxylic acid on the tag and the amine on the linker. Cleavage of photolabile linkers is usually 

55 perfomied with UV light of 350 nm wavelength at intensities and times known to those in the art. Cleavage of the linkers 
results In liberation of a primary amide on the tag. Examples of photodeavable linkers include nitrophenyl glycine 
esters, exo- and endo-2-benzonort)omeyl chlorides and methane sulfonates, and 3-amino-3(2-nitrophenyl) propionic 
acid. Examples of enzymatic deavage indude esterases which will deave ester bonds, nucleases which will deave 
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phosphodiester bonds, proteases which cleave peptide bonds, etc. 

[0159] A preferred linker component has an ortho-nilrobenzyl structure as shown below: 




15 wherein one cart>on atom at positions a, b. c, d or e Is substituted with -L^-X, and (which is preferably a direct bond) 
is present to the left of N(R^) in the above structure. Such a linker component is susceptible to selective photo-induced 
cleavage of the bond between the carbon labeled "a" and N(R^). The identity of is not typically critical to the cleavage 
reaction, however R^ is preferably selected from hydrogen and hydrocarbyl. The present invention provides that in the 
above structure, -N(R1)- could be replaced with -0-. Also in the above structure, one or more of positions b, c, d or e 

20 may optionally be substituted with alkyi, alkoxy, fluoride, chloride, hydroxyl. carboxylate or amide, where these sub- 
stituents are Independently selected at each occurrence. 

[0160] A further preferred linker component with a chemical handle has the following structure: 



d 
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35 wherein one or more of positions b, c, d or e is substituted with hydrogen, alkyi, alkoxy, fluoride, chloride, hydroxyl, 
carboxylate or amide. is hydrogen or hydrocarbyl, and is -OH or a group that either protects or activates a 
carix>xyljc acid for coupling with another moiety. Fluorocarbon and hydrofluorocartK)n groups are preferred groups that 
activate a caitoxytic acid toward coupling with another moiety. 

^0 3. Molecule of Interest (MQI) 

[0161] Examples of MOIs include nucleic acids or nucleic add analogues (e.g., PNA), fragments of nucleic acids (/. 
e., nucleic acid fragments), synthetic nucleic acids or fragments, oligonucleotides (e.g., DNA or RNA), proteins, pep- 
tides, antibodies or antibody fragments, receptors, receptor ligands, members of a ligand pair, cytokines, hormones, 

45 oligosaccharides, synthetic organic molecules, drugs, and combinations thereof. 

[0162] Preferred MOIs include nucleic acid fragments. Preferred nucleic acid fragments are primer sequences that 
are complementary to sequences present in vectors, where the vectors are used for base sequencing. Preferably a 
nucleic acid fragment is attached directly or indirectly to a tag at other than the 3* end of the fragment; and most 
preferably at the 5' end of the fragment. Nucleic acid fragments may be purchased or prepared based upon genetic 

50 databases (e.g., Dib et al., Nature 380:152-154, 1996 and CEPH Genotype Database, http://wvyw.cephb.fr) and com- 
mercial vendors (e.g., Promega, Madison, Wl). 

[0163] As used herein, MOI includes derivatives of an MOI that contain functionality useful in joining the MOi to a 
T-L-Lh compound. For example, a nucleic add fragment that has a phosphodiester at the 5* end, where the phosphodi- 
ester is also bonded to an alkyleneamine, is an MOI. Such an MOI is described in, e.g., U.S. Patent 4,762,779 which 
55 is incorporated herein by reference. A nudeic acid fragment with an internal modification is also an MOI. An exemplary 
internal modification of a nudeic acid fragment is where the base (e.g., adenine, guanine, cytosine, thymidine, uracil) 
has been modified to add a reactive functional group. Such intemally modified nudeic acid fragments are commercially 
available from. e.g.. Glen Research. Hemdon, VA. Another exemplary intemal modification of a nudeic acid fragment 
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is where an abasic phosphoramidate is used to synthesize a modified phosphodiester which is interposed between a 
sugar and phosphate group of a nucleic acid fragment. The abasic phosphoramidate contains a reactive group which 
allows a nucleic acid fragment that contains this phosphoramidate-derived moiety to be joined to another moiety, e.g., 
a T-L-Lf, compound. Such abasic phosphoramldates are commercially available from, e.g., Clonetech Laboratories, 
5 Inc.. Palo Alto, CA. 

4. Chemical Handles (U ) 

[0164] A chemical handle is a stable yet reactive atomic arrangement present as part of a first molecule, where the 
10 handle can undergo chemical reaction with a complementary chemical handle present as part of a second molecule, 
so as to form a covalent bond between the two molecules. For example, the chemical handle may be a hydroxyl group, 
and the complementary chemical handle may be a carboxylic acid group (or an activated derivative thereof, e.g., a 
hydrofluroaryl ester), whereupon reaction between these two handles forms a covalent bond (specifically, an ester 
group) that joins the two molecules together. 
^5 [0165] Chemical handles may be used in a large number of covalent bond-forming reactions that are suitable for 
attaching tags to linkers, and linkers to MOls. Such reactions include alkylation (e.g., to form ethers, thioethers), acyla- 
tion (e.g„ to form esters, amides, carbamates, ureas, thioureas), phosphorylation (e.g., to fonm phosphates, phospho- 
nates, phosphoramides. phosphonamides), sulfonylation (e.g., to form sulfonates, sulfonamides), condensation (e.g., 
to form imines. oxtmes. hydrazones), silylation, disulfide formation, and generation of reactive intermediates, such as 
20 nitrenes or carbenes, by photolysis. In general, handles and bond-fonming reactions which are suitable for attaching 
tags to linkers are also suitable for attaching linkers to MOls, and vice-versa. In some cases, the MOI may undergo 
prior modification or derivitization to provide the handle needed for attaching the linker. 

[0166] One type of bond especially useful for attaching linkers to MOls is the disulfide bond. Its formation requires 
the presence of a thiol group ("handle") on the linker, and another thiol group on the MOI. Mild oxidizing conditions 

25 then suffice to bond the two thiols together as a disulfide. Disulfide formation can also be induced by using an excess 
of an appropriate disulfide exchange reagent, e.g., pyridyl disulfides. Because disulfide formation Is readily reversible, 
the disulfide may also be used as the cleavable bond for liberating the tag, if desired. This is typically accomplished 
under similarly mild conditions, using an excess of an appropriate thiol exchange reagent, e.g., dithiothreitol. 
[01 67] Of particular interest for linking tags (or tags with linkers) to oligonucleotides is the formation of amide bonds. 

30 Primary aliphatic amine handles can be readily introduced onto synthetic oligonucleotides with phosphoramidites such 
as 6-monomethoxytritylhexylcyanoethyl-N.N-diisopropyl phosphoramidite (available from Glenn Research, Steding, 
VA). The amines found on natural nucleotides such as adenosine and guanosine are virtually unreactive when com- 
pared to the introduced primary amine. This difference in reactivity forms the basis of the ability to selectively fonn 
amides and related bonding groups (e.g., ureas, thioureas, sulfonamides) with the introduced primary amine, and not 

35 the nucleotide amines. 

[0168] As listed in the Molecular Probes catalog (Eugene, OR), a partial enumeration of amine-reactive functional 
groups includes activated cartjoxylic esters, isocyanates, isothiocyanates, sulfonyl halides, and dichlorotriazenes. Ac- 
tive esters are excellent reagents for amine modification since the amide products formed are very stable. Also, these 
reagents have good reactivity with aliphatic amines and low reactivity with the nucleotide amines of oligonucleotides. 
<o Examples of active esters include N-hydroxysucclnimlde esters, pentafluorophenyl esters, tetrafluorophenyl esters, 
and p-nitrophenyl esters. Active esters are useful because they can be made from virtually any molecule that contains 
a carboxylic acid. Methods to make active esters are listed in Bodansky (Principles of peptide Chemistry (2d ed.), 
Springer Verlag, London, 1993). 

[0169] The "X" group in molecules designated by P"s-L-X serves as a chemical handle which allows the molecule 
45 to be joined to a biomolecule. e.g., a nucleic acid molecule. The X group is a phosphoramidite or H-phosphonate group. 
With X being anyone of these two functk)nalities. then the T-L-X molecule may be added to the end of an oligonucleotide 
that has been synthesized by any of the well-known phosphoramidite (also known as phosphite-triester), or H-phos- 
phonate synthetic melhodoigies for oligonucleotide synthesis. 

[0170] For instance, where X is a phosphoramidite group, the T-L-X molecule may have the structure 

50 

I 

55 NR2 

where a preferred T-L-X molecule with X being a phosphoramidite has the structure 
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T— CHj— CONH— (CH2)6— ^OR 

1 

NR2 

[0171] In the above phosphoramidite-containing T-L-X molecules, R Is typically an alkyl group, such as a C1-C6 alkyi 
group, or an alkyl group having a substituent in place of a hydrogen of the alkyl group, where suitable substituents 
include cyano (CN) group. Thus, "OR" in the phosphoramidite may be OCH2CH2CN, and NR2 may be N(isopropyl)2, 
which are two groups commonly employed in preparing oligonucleotides using phosphoramidite chemistry. NR2 may 
altematively be. for example, a morpholine group. In one embodiment, R is an alkyl group or a substituted alkyl group 
having one or more substituents selected from halogen and cyano, and the two R groups of NR2 may be bonded 
together to form a cydoalkyi group. 

[0172] A 2nd approach to synthesizing oligonucleotides takes advantage of H-phosphonate chemistry. A T-L-X mol- 
ecule of the invention may have a chemical handle / X group that is an H-phosphonate group, and thus have a structure 
as follows: 

O 

T— L— O— P-0 R3NH 
H 

In the above H-phosphonate group, R3 represents three alkyl groups, typically having 1-6 cartion atoms in each alkyl 
group. Ethyl is a common R group in H-phosphonate reagents used in oligonucleotide synthesis, and therefore is a 
preferred R group in T-L-X molecules of the invention wherein X is an H-phosphonate group, 

5. Linker Attachment 

[0173] Typically, a single type of linker is used to connect a particular set or family of tags to a particular set or family 
of MOIs, In a preferred embodiment of the invention, a single, uniform procedure may be followed to create all the 
various T-L-MOl structures. This is especially advantageous when the set of T-L-MOl structures is large, because it 
allows the set to be prepared using the methods of combinatorial chemistry or other parallel processing technology. 
In a similar manner, the use of a single type of linker allows a single, uniform procedure to be employed for cleaving 
all the various T-L-MOl structures. Again, this is advantageous for a large set of T-L-MOl structures, because the set 
may be processed in a parallel, repetitive, and/or automated manner. 

[0174] There are, however, other emlwdiment of the present invention, wherein two or more types of linker are used 
to connect different subsets of tags to corresponding subsets of MOIs. In this case, selective cleavage conditions may 
be used to cleave each of the linkers independently, without cleaving the linkers present on other subsets of MOIs. 
[0175] A large number of covalent bond-fomiing reactions are suitable for attaching tags to linkers, and linkers to 
MOIs. Such reactions include alkyiation (e.g., to fomi ethers, thioethers). acylation (e.g., to form esters, amides, car- 
bamates, ureas, thioureas), phosphorylation (e.g., to form phosphates, phosphonates, phosphoramides, phosphona- 
mides), sulfonylation (e.g., to form sulfonates, sulfonamides), condensation (e.g., tofomn imines, oximes, hydrazones), 
silylation. disulfide formation, and generation of reactive intermediates, such as nilrenes or carbenes, by photolysis. 
In general, handles and tx)nd-forming reactions which are suitable for attaching tags to linkers are also suitable for 
attaching linkers to MOIs, and vice-versa. In some cases, the MOl may undergo prior modification or derivitlzation to 
provide the handle needed for attaching the linker 

[0176] One type of bond especially useful for attaching linkers to MOIs is the disulfide bond. Its formation requires 
the presence of a thiol group ("handle") on the linker, and another thiol group on the MOl. Mild oxidizing conditions 
then suffice to bond the two thiols together as a disulfide. Disulfide formation can also be induced by using an excess 
of an appropriate disulfide exchange reagent, e.g., pyridyl disulfides. Because disulfide fomnation is readily reversible, 
the disulfide may also be used as the cleavable bond for liberating the tag, if desired. This is typically accomplished 
under simiiariy mild conditions, using an excess of an appropriate thiol exchange reagent, e.g., dithiothreitol. 
[0177] Of particular interest for linking tags to oligonucleotides is the formation of amide bonds. Primary aliphatic 
amine handles can be readily introduced onto synthetic oligonucleotides with phosphoramidites such as 6-monometh- 
oxytritylhexylcyanoethyl-N.N-diisopropyl phosphoramidite (available from Glenn Research, Sterling, VA). The amines 
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found on natural nucleotides such as adenosine and guanosine are virtually unreactive when compared to the intro- 
duced primary amine. This difference in reactivity forms the basis of the ability to selectively form amides and related 
bonding groups (e.g., ureas, thioureas, sulfonamides) with the introduced primary amine, and not the nucleotide 
amines. 

5 [0178] As listed in the Molecular Probes catalog (Eugene. OR), a partial enumeration of amlne-reactive functional 
groups includes activated cartx)xylic esters, isocyanates, Isothiocyanates, sulfonyl halides, and dichiorotriazenes. Ac- 
tive esters are excellent reagents for amine modification since the amide products formed are very stable. Also, these 
reagents have good reactivity with aliphatic amines and low reactivity with the nucleotide amines of oligonucleotides. 
Examples of active esters include N-hydroxysuccinimide esters, pentafluorophenyl esters, tetrafluorophenyl esters. 

10 and p-nitrophenyl esters. Active esters are useful because they can be made from virtually any molecule that contains 
a carboxylic acid. Methods to make active esters are listed in Bodansky (Principles of Peptide Chemistry (2d ed.). 
Springer Veriag, London, 1993). 

{01 79] Numerous commercial cross-linking reagents exist which can serve as linkers (e.g., see Pierce Cross-linkers, 
Pierce Chemical Co., Rockford, IL). Among these are homobifunctional amine-reactive cross-linking reagents which 

t5 are exemplified by homobifunctional imidoesters and N-hydroxysuccinimidyl (NHS) esters. There also exist heterobi- 
functional cross-linking reagents possess two or more different reactive groups that allows for sequential reactions. 
Imidoesters react rapidly with amines at alkaline pH, NHS-esters give stable products when reacted with primary or 
secondary amines. Maleimides, alkyl and aryl halides, alpha-haloacyls and pyridyl disulfides are thiol reactive. Male- 
imides are specific for thiol (sulfhydryl) groups in the pH range of 6.5 to 7.5, and at alkaline pH can become amine 

20 reactive. The thioether linkage is stable under physiological conditions. Alpha-haloacetyl cross-linking reagents contain 
the lodoacetyl group and are reactive towards sulfhydryls. Imidazoles can react with the iodoacetyf moiety, but the 
reaction is very slow. Pyridyl disulfides react with thiol groups to fonn a disulfide bond. CartK)diimides couple carboxyls 
to primary amines of hydrazides which give rises to the fomiation of an acyl-hydrazine bond. The arylazides are pho- 
toaffinity reagents which are chemically inert until exposed to UV or visible light. When such compounds are photolyzed 

25 at 250-460 nm, a reactive aryl nitrene is formed. The reactive aryl nitrene is relatively non-specific. Glyoxals are reactive 
towards guanidinyl portion of arginine. 

[0180] In one typical embodiment of the present invention, a tag is first bonded to a linker, then the combination of 
tag and linker is bonded to a MOI, to create the structure T-L-MOI. Alternatively, the same structure is formed by first 
bonding a linker to a MOI, and then bonding the combination of linker and MOI to a tag. An example is where the MOI 

30 Is a DNA primer or oligonucleotide. In that case, the tag is typically first bonded to a linker, then the T-L is bonded to 
a DNA primer or oligonucleotide, which is then used, for example, in a sequencing reaction. 
[0181] One useful fomn in which a tag could be reversibly attached to an MOI (e.g.. an oligonucleotide or DNA se- 
quencing primer) is through a chemically labile linker. One preferred design for the linker allows the linker to be cleaved 
when exposed to a volatile organic acid, for example, trifluoroacetic acid (TFA). TFA in particular is compatible with 

35 most methods of MS ionization, including electrospray 

[01 82] As described in detail below, the invention provides methodology for genotyping. A composition which is useful 
in the genotyping method comprises a purality of compounds of the formula: 

40 t"^-l.moi 

wherein, 

[0183] is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula. L is an 

45 organic group which allows a T^s_containing moiety to be cleaved from the remainder of the compound, wherein the 
P"s^ntaining moiety comprises a functional group which supports a single ionized charge state when the compound 
is subjected to mass spectrometry and is selected from tertiary amine, quatemary amine and organic acid, in the 
formula, MOI is a nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3* end of the 
MOI. In the composition, at least two compounds have the same T^^ but the MOI groups of those molecules have 

50 non-identical nucleotide lengths. 

[01 84] Another composition that is useful in the genotyping method comprises a plurality of compounds of the formula: 

t"'^-l-moi 

55 

wherein T^^ is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula, L is an 
organic group which allows a T^-containing moiety to be cleaved from the remainder of the compound, wherein the 
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T™-containing moiety comprises a functional group which supports a single ionized charge state when the compound 
Is subjected to mass spectrometry and is selected from tertiary amine, quatemary amine and organic acid. In the 
formula. MOI is a nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3' end of the 
MOI. In the composition, at least two compounds have the same but those compounds have non-identical elution 
5 times by column chromatography. 

[0185] Another composition^ that may be used in the geriotyping method comprises a plurality of compounds of the 
formula: 

T^'^-L-MOI 

wherein P"^ is an organic group detectable by mass spectrometry, comprising cartjon. at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula, L is an 
organic group which allows a "P"s-containing moiety to be cleaved from the remainder of the compound, wherein the 

15 T^s-containing moiety comprises a functional group which supports a single ionized charge state when the compound 
Is subjected to mass spectrometry and is selected from tertiary amine, quatemary amine and organic acid. In the 
formula. MOI is a nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3' end of the 
MOI. In the composition, no two compounds which have the same MOI nucleotide length also have the same T^. 
[0186] In the above composition, the plurality is preferably greater than 2, and preferably greater than 4. Also, the 

20 nudeic acid fragment in the MOt have a sequence complementary to a portion of a vector, wherein the fragment is 
capable of priming polynucleotide synthesis. Preferably, the 1™ groups of members of the plurality differ by at least 2 
amu, and may differ by at least 4 amu. 

[0187] The invention also provides for a composition comprising a plurality of sets of compounds, each set of com- 
pounds having the fomnula: 

25 

T'^-L-MOI 

wherein T"^ is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
30 fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula, L is an 
organic group which allows a T^s^ntalning moiety to be cleaved from the remainder of the compound, wherein the 
T'"5-containing moiety comprises a functional group which supports a single ionized charge state when the compound 
is subjected to mass spectrometry and Is selected from tertiary amine, quatemary amine and organic acid. Also, in the 
fomnula, MOI is a nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3' end of the 
35 MOI. In the composition, members within a first set of compounds have identical Tms groups, however have non- 
identical MOI groups with differing numbers of nucleotides in the MOI and there are at least ten members within the 
first set, wherein between sets, the "P"* groups differ by at least 2 amu. The plurality is preferably at least 3. and more 
preferably at least 5. 

[0188] The invention also provides for a composition comprising a plurality of sets of compounds, each set of oom- 
40 pounds having the formula 

T^-L-MOI 

^5 wherein, T^^ is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula, L is an 
organic group which allows a T^-containing moiety to be cleaved from the remainder of the compound, wherein the 
P"s^^ntaining moiety comprises a functional group which supports a single Ionized charge state when the compound 
is subjected to mass spectrometry and is selected from tertiary amine, quatemary amine and organic acid. In the 

50 formula, MOI is a nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3' end of the 
MOI. In the composition, the compounds within a set have the same elution time but non-identical 1™ groups. 
[0189] In addition, the invention provides a kit for genotyping. The kit comprises a plurality of amplification primer 
pairs, wherein at least one of the primers has the formula: 

T'^-L-MOI 

wherein T^ is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
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nuorlde. and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula, L is an 
organic group \A^hich allows a P^-containing moiety to be cleaved from the remainder of the compound, wherein the 
pns^ntaining moiety comprises a functional group which supports a single ionized charge state when the compound 
is subjected to mass spectrometry and is selected from tertiary amine, quatemary amine and organic acid. In the 
5 formula, MOI is a nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3* end of the 
MOI; and each primer pair associates with a different locL In the kit. the pluality js preferably at least 3. and more 
preferably at least 5. 

[0190] As noted above, the present invention provides compositions and methods for determining the sequence of 
nucleic acid molecules. Briefly, such methods generally comprise the steps of (a) generating tagged nucleic acid frag- 
to ments which are complementary to a selected nudeic acid molecule (e.g.. tagged fragments) from a first tenminus to 
a second terminus of a nucleic acid molecule), wherein a tag is correlative with a particular or selected nucleotide, and 
may be detected by any of a variety of methods, (b) separating the tagged fragments by sequential length, (c) deaving 
a tag from a tagged fragment, and (d) detecting the tags, and thereby determining the sequence of the nudeic acid 
molecule. Each of the aspects will be discussed in more detail below. 

15 

B. DIAGNOSTIC METHODS 
1 . Introduction 

20 [0191] As noted above, the present invention also provides a wide variety of methods wherein the above-described 
tags and/or linkers may be utilized in place of traditional labels (e.g.. radioactive or enzymatic), in order to enhance 
the spedficity, sensitivity, or number of samples that may be simultaneously analyzed, within a given method. Repre- 
sentative examples of such methods which may be enhanced indude, for example, RNA amplification (see Lizardi et 
aL, Bio/Technology 6:1197-1202, 1988; Kramer et al.. Nature 339:401-402, 1989; Lomeli et al.. Clinical Chem. 35(9): 

25 1826-1831, 1989; U.S. Patent No. 4,786.600), and DNA amplification utilizing LCR or polymerase chain reaction 
("PGR") (see, U.S. Patent Nos. 4,683,195. 4.683,202, and 4.800.159), 

[01 92] The CMST technology platform can be utilized in a number of applications in which nucleic acid measurements 
are made on a large scale. This technology platform can be used with or without a separation or sizing methodology. 
An example of a non-sizing assay would include single nucleotide polymorphisms (SNP) assays in which oligonude- 

30 otides are used to detect the presence or absence of a base change in a target nudeic acid. Alternatively, an HPLC 
or separation system can be appended to the mass spectrometry detector (MSD) in which nudeic acid fragments can 
be sorted by size and thus a mass spectrometer tag is combined with a retention time to identify a sequence. HPLC 
of nucleic acids (Huber et al, 1993, AnaLBiochem., 212, p351; Ruber et al.. 1993, Nuc. Acids Res,, 21, p1061; Huber 
et al., 1993, Biotechniques, 16, p898) or Denaturing HPLC (DHPLC) is a method generally successful at separating 

35 DNA duplexes that differ in the identity of one or more base pair and is thus useful for scanning for mutations. A general 
method has been developed using 100 mM triethylammonium acetate as ion-pairing reagent in which oligonudeotides 
could be successfully separated on alkylated non-porous 2.3 jiM poly(styrene-divinylbenzene) via HPLC (Oefner et 
al., 1 994, Anal. Biochem., 223, p39). The technique also allowed the separation of PCR products differing only 4 to 8 
base pairs in length within a size range of 50 to 200 nudeotides. 

40 [0193] DHPLC has significantly accelerated the search for SNPs (Oefner and Undertiill, Am. 1 Hum, Genetics, 57. 
A266, 1995). Numerous applications of the DHPLC technique indude the identification of polymorphisms on the human 
Y chromosome to facilitate evolutionary studies (PNAS USA, 93. 196-200, 1996). and the rapid identification of disease 
causing mutations on chromosome 19 that causes ataxia (Cell, 87, 543-552, 1996). 

[0194] While it has been difficult to devise genetic tests for multifactorial diseases, more than 200 known human 
45 disorders are caused by a defect in a single gene, often a change of a single amino add residue (Olsen. Biotechnology: 
An industry comes of age. National Academic Press. 1 986). 

[0195] Sensitive mutation detection techniques offer extraordinary possibilities for mutation screening. Efficient ge- 
netic tests may also enable screening for oncogenic mutations in cells exfoliated from the respiratory tract or the bladder 
in connection with health checkups (Sidransky et aL, Science 252:706, 1991). Also, when an unknown gene causes 

50 a genetic disease, methods to monitor DNA sequence variants are useful to study the inheritance of disease through 
genetic linkage analysis. Several different approaches have been pursued, but none are both effident and inexpensive 
enough for truly widescale application (Cotton, RGH. (1997). Mutation Detection, Oxford University Press, New Yori<). 
Mutations involving a single nudeotkle can be Identified in a sample by physical, chemical, or enzymatic means. Gen- 
erally, methods for mutation detection may be divided into scanning techniques (Fearon, 1997, Science, 278, 

55 pi 043-1 050) which are suitable to identify previously unknown mutations, and techniques designed to detect, distin- 
guish, or quantify known sequence variants (Holtzman et. Al., 1997, Science, 278, 602-605). 
[0196] Several scanning techniques for mutation detection have been developed for heteroduplexes where the pres- 
ence of a mismatch induces abnormal behavior when the duplex is partially denatured. This phenomenon Is exploited 
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in denaturing and temperature gradient gel electrophoresis (DGGE and TGGE, respectively) methods. Duplexes mis- 
matched in even a single nucleotide position can partially denature, resulting in retarded migration, when electro- 
phoresed in an increasingly denaturing gradient gel (Orita. Genomics 5:874. 1989; Keen. Trends Genet, 7:5, 1991., 
Myers et al.. Nature 373:495, 1985; Abrams et al,. Genomics 7:463. f 990; Hence et al., Nud, Adds Res. t8:6733, 

5 1990). Although mutations may be detected, no information is obtained regarding the precise location or sequence 

. _ around the. mutation. _ 

[0197] Mismatched bases in a duplex are also susceptible to chemical modification. Such modification can render 
the strands susceptible to cleavage at the site of the mismatch or cause a polymerase to stop in a subsequent extension 
reaction. The chemical cleavage technique allows identification of a mutation in target sequences of up to 2 kb and It 
to provides information on the approximate location of mismatched nucleotide(s) (Cotton et al., PNAS USA 85:4397, 
1988; Ganguly et al., Nucl. Acids Res 18:3933, 1991). 

[0198] An alternative strategy for detecting a mutation in a DNA strand is by substituting (during synthesis) one of 
the normal nucleotides with a modified nucleotide, altering the molecular weight or other physical parameter of the 
product. A strand with an Increased or decreased number of this modified nucleotide relative to the wlld-type sequence 
15 exhibits altered electrophoretic mobility (Naylor et al.. Lancet 337:635. 1991). Again, this technique detects the pres- 
ence of a mutation, but does not provide the location. 

[0199] All of the atwve-mentioned techniques indicate the presence of a mutation in a limited segment of DNA and 
some of them allow approximate localization within the segment. However, sequence analysis is still required to locate 
the precise position of the base change. 

20 [0200] A large number of other techniques have been developed to analyze known sequence variants or single 
nucleotide polymorphisms. Automation and economy are very important considerations for these types of analyses 
that may be applied, for screening individuals and the general population. Mutations may be identified via their desta- 
bilizing effects on the hybridization of short oligonucleotide probes to a target sequence (see Wetmur, Cnt Rev. Bio- 
ahem. Mol. BioL, 26:227, 1991 ). Generally, this technique, allele-specific oligonucleotide hybridization involves ampli- 

25 fication of target sequences and subsequent hybridization with short oligonucleotide probes. Oligonucleotide-ligation 
assay is an extension of PCR-based screening that uses an ELISA-based assay (OLA, Nickerson et al., P/oc. Natl. 
Acad. Sci. USA 87:8923, 1990) to detect the PGR products that contain the target sequence. Thus, both gel electro- 
phoresis and colony hybridization are eliminated. 

[0201] As noted above, the CMST technology also provides a wide variety of methods where the cleavable tags and/ 

30 or linkers may be utilized in place of traditional labels (e.g., radioactive, fluorescent, or enzymatic), in order enhance 
the specificity, sensitivity, or number of samples that may be simultaneously analyzed, within a given me.thod. Repre- 
sentative examples of such methods which may be enhanced include, for example, standard nucleic acid hybridization 
reactions (see Sambrook et al., supra), diagnostic reactions such as Cycling Probe Technology (CPT) (see U.S. Patent 
Nos. 4,876.187 and 5,011,769) or Oligonucleotide-Ligation Assay (OLA) (Buricet et al,. Science t95:180, 1987). 

35 [0202] The CMST technology combined with hybridization can be applied to forensics. DNA analysis readily permits 
the deduction of retatedness between individuals such as Is required in paternity testing. Genetic analysis has proven 
highly useful in bone marrow transplantation, where it is necessary to distinguish between closely related donor and 
recipient cells. Two types of probes are now in use for DNA fingerprinting and genotyping. Polymorphic minisatellite 
DNA probes identify multiple DNA sequences, each present in variable forms in different individuals, thus generating 

40 patterns that are complex and highly variable between individuals. VNTR probes identify single sequences in the ge- 
nome, but these sequences may be present in up to 30 different forms In the human population as distinguished by 
the size of the identified fragments (Bennett and Todd. 1996, Ann. Rev. Genetics, 30, p343-70). 
[0203] Tumor diagnostics and staging (Goodfellow and Wells, 1995. J. Natl. Cancer Inst, 87, pi 51 5-23) is another 
application of the CMST technology platform. The detection of oncogenes and their respective polymorphisms is an 

<5 important field of nucleic acid diagnostics. The cellular oncogenes can be activated by specific modifications such as 
point mutations (as in the c-K-ras oncogene in bladder carcinoma and in colorectal tumors), promoter induction, gene 
amplification (as in the N-myc oncogene in the case of neuroblastoma) or the rearrangement of chromosomes (as In 
the translocation of the c-abi oncogene from chromosome 9 to chromosome 22 in the case of chronic myeloid leukemia). 
The CMST technology can also be applied to transplantation analysis, genome diagnostics (four percent of all newborns 

50 are bom with genetic defects). Of the 3,500 hereditary diseases described which are caused by the modification of 
only a single gene, the primary molecular defects are only known for at)out 400. The use of DNA probes with cleavable 
tags can be used to detect the presence or absence of micro-organisms In any type of sample or specimen. 
[0204] The CMST technology platfomi can be coupled with different sizing techniques. Capillary electrophoresis 
(CE) in its various manifestations (free solution, isotachophoresis, isoelectric focusing, polyacrylamide gel, micellar 

55 electrokinetic "chromatography") is developing as a method for rapid high resolution separations of very small sample 
volumes of complex mixtures. In combination with the inherent sensitivity and selectivity of MS, CE-MS is a potential 
powerful technique for bioanalysis. In the novel application described here, the Interfacing of these two methods could 
lead to superior DNA sequencing methods that eclipse the cun-ent rate methods of sequencing by several orders of 
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magnitude, 

[0205] The correspondence between CE and electrospray ionization (ESI) flow rates and the fact that both are fa- 
cilitated by (and primarily used for) ionic species in solution provide the basis for an extremely attractive combination. 
The combination of both capillary zone electrophoresis (CZE) and capillary isotachophoresis with quadrapole mass 
5 spectrometers based upon ESI have been described (Olivares et al., Anal, Chem. 59:1230. 1987; Smith et al., Anal. 
Chem. 60:436. 1988; Loo et al., Anal. Chem. 179:404, 1989; Edmondsjt al.. Chroma. 474:21. 1989; Loo et al.,y. 
Microcolumn Sep. t:223ri989; Lee et al. J. Chroma'tog. 458:313^ 1988; Smith et al.. J. Chromatog. 480:211, 1989;' 
Grese et al., J. Am. Chem. Soc. 1 17:2835, 1989). 

[0206] The most powerful separation method for DNA fragments is polyacrylamide gel electrophoresis (PAGE), gen- 

10 erally in a slab gel format. However, the major limitation of the current technology is the relatively long time required 
to perform the gel electrophoresis of DNA fragments produced in the sequencing reactions. An increase magnitude 
(10-fold) can be achieved with the use of capillary electrophoresis which utilize ultrathin gels. In polyacrylamide gels, 
DNA fragments sieve and migrate as a function of length and this approach has now been applied to CE. Remaricable 
plate number per meter has now been achieved with cross-linked polyacrylamide (10-^ plates per meter, Cohen et al., 

15 Proc. Natl. Acad. Sc/., USA 85:9660. 1988). Such CE columns as described can be employed for DNA sequencing. 
Smith and others (Smith et al., Nuc. Adds. Res. /8:4417. 1990) have suggested employing multiple capillaries in 
parallel to increase throughput. Likewise, Mathies and Huang (Mathies and Huang, Nature 359:167, 1992) have intro- 
duced capillary electrophoresis in which separations are performed on a parallel array of capillaries and demonstrated 
high through-put sequencing (Huang et al., Anal. Chem. 64:967. 1992, Huang et al.. Anal. Chem, 64:2149, 1992). 

20 Since there is no reason to run parallel lanes, there is no reason to use a slab gel. Therefore, one can employ a tube 
gel fonnat for the electrophoretic separation method. Grossman (Grossman et al.. Genet. Anal. Tech. AppL 9:9, 1 992) 
have shown that considerable advantage is gained when a tube gel format is used in place of a slab gel format. This 
is due to the greater ability to dissipate Joule heat in a tube format compared to a slab gel which results in faster run 
. times (by 50%), and much higher resolution of high molecular weight DNA fragments (greater than 1000 nt). Long 

25 reads are critical in genomic sequencing. Therefore, the use of cleavable tags in sequencing has the additional ad- 
vantage of allowing the user to employ the most efficient and sensitive DNA separation method which also possesses 
the highest resolution. 

[0207] The underlying concept behind the use of microfabricated devices is the ability to increase the information 
density in electrophoresis by miniaturizing the lane dimension to about 100 micrometers. The electronics industry 

30 routinely uses microfabrication to make circuits with features of less than one micron in size. The current density of 
capillary arrays is limited the outside diameter of the capillary tube. Microfabrication of channels produces a higher 
density of arrays. Microfabrication also permits physical assemblies not possible with glass fibers and links the channels 
directly to other devices on a chip. Few devices have been constructed on miaochlps for separation technologies. A 
gas chromatograph (Teny et al., IEEE Trans. Electron Device, ED-26:1880. 1979) and a liquid chromatograph (Manz 

35 et al.. Sens. Actuators 6f :249. 1990) have been fabricated on silicon chips, but these devices have not been widely 
used. Several groups have reported separating fluorescent dyes and amino acids on microfabricated devices (Manz 
etal., J. Chromatography 593:253, 1992, Effenhauseretal., /Ina/. Chem. 65:2637. 1993). Recently Wool ley and Math- 
ies (Woolley and Mathies, Proc. Natl. Acad. Sci. 9t:11348, 1994) have shown that photolithography and chemical 
etching can be used to make large numbers of separation channels on glass substrates. The channels are filled with 

<o hydroxyethyl cellulose (HEC) separation matrices. 

[0208] Hybotropes are advantageously employed In assays and other methods wherein a tagged oligonucleotide is 
hybridized to a complemenary or semi-complementary (I.e., almost, but not exactly the same sequence as the tagged 
ODN) nucleic acid fragment. Hybotropes are more fully described in, for example. U.S. Patent Application Nos. 
60/026,621 (filed September 24, 1996); 08/71 9, 132 (filed September24. 1996); 08/933.924 (filed September 23, 1997); 

45 09/002.051 (filed December 31 , 1997); and PCT Intemational Publication No. WO 98/1 3527, all of which are incorpo- 
rated herein in their entireties. 

[0209] The observation that A-T,^ does not change as a function of concentration of hybotrope has substantial utility 
for use in DNA, RNA or nucleic acid amplifications based on primer extension by a polymerase (e.g., polymerase chain 
reaction, see U.S. Patent Nos. 4.683,195; 4,683,202; and 4,800,159. cycling probe technology, NASBA), ligation (LCR, 

50 ligation chain reaction), and RNA amplification (see Lizardi et al., B/o/7ec/ino/ogy 6:1197, 1988; Kramer et al., Nature 
339:401, 1989; Lomeli et al.. Clin. Chem. 35:1826, 1989; U.S. Patent No. 3,786,600). The observation that wt (wild 
type) and mt (mutant) 30-mer oligonucleotides (30 linked nucleotides in the oligonucleotide) can be distinguished on 
the basis of thermal melting in 0.5 M LiTCA penfnits the possibility of a substantial improvement in priming efficiency 
in PGR. In its current configuration, the PGR buffer is optimized for the polymerase rather for specific priming. That is, 

55 conditions have evolved since the introduction of the technique that favor performance of the polymerase over the 
perfonnance of specificity of priming with oligonucleotides. Thus, PCR buffer as cun-ently commercially available does 
not provide or support a high level of stringency of hybridization of PCR primers. 

[021 0] Commercially available PCR buffers are examined with respect to the melting behavior of 24-mer oligonucle- 



37 



EP 0 990 047 B1 



otides in both the wild-type (wt) and mutant (mt) forms. Alternatively, priming is performed in a hybotrope solution and 
chain extension is performed in a separate buffer that supports the polymerase. For example, a solid phase PGR could 
be employed where the solid phase is moved through two solutions. Priming would occur in some appropriate con- 
centration of LITCA or TMATCA and then the polymerase chain reaction would take place in a different PGR buffer 
5 containing the polymerase. It is also possible to conduct the first few rounds in the amplification in a hybotrope based 
hybridization solution and conducting the remaining rounds on normal PGR buffer (generally, only the first few rounds 
are important for specificity). 

[0211] The use of abasic modified oligonucleotides also increases the specificity of priming in the PGR (see, e.g., 
U.S. Patent Application Nos. 60/026.621 (filed September 24. 1996); 08/719,132 (filed September 24, 1996); 

10 08/933.924 (filed September 23. 1 997); 09/002.051 (filed December 31.1 997); and PCT International Publication No. 
WO 98/13527). One abasic substitution Incorporated into an oligonucleotide reduces the HCT by 2.5'*C. Two oligonu- 
cleotides probes containing 3 abasic sites per 24-mer have a HGT deaease to S^G relative to the unsubstituted control. 
This decrease in the HCT dramatically increases the level of specificity of priming in the PGR reaction. This is due to 
the reduction of false or mis-priming during the first 1 0 cycles of the PGR. That is, the enthalpy of the abasic substituted 

15 oligonucleotide increases relative to the unsubstituted primer, thus increasing the specificity of priming. The primer is 
preferably 6 to 36 baSes in length and contains 1 to 6 abasic sites. The abasic sites are preferably separated by 4. 5. 
6, 7 or 8 nucleotides and may be separated by up to 1 2 to 24 nucleotides. The substitutions are also preferably clustered 
at the 3' end of the primer to ensure specificity of primer extension by nucleic acid polymerases. 
[021 2] Furthermore, the combination of an abasic site in a PGR primer and the use of a hybotrope salt solution which 

20 promotes a high enthalpy value for the primer duplex significantiy lowers the A-HGT of the primer duplex. As discussed 
above, when the A-HCT decreases, the stringency factor increases and high-discrimination priming of the polymerase 
chain reaction can take place. These are conditions required for multiplexing PGRs. The term multiplexing refers to 
the ability to use more than one set of primers in a PGR reaction and generate multiple products or the ability to use 
more than one target nucleic acid per set of PGR primers. The use of the hytwtrope teti-amethylammonium trichloro- 

25 acetate is of particular utility because the dependence of G-i-G content on T„^ (stability) is neufi-alized. 

[0213] The hybrotropic solutions described are used to increase the specificity of priming in the PGR. There are 
several options in terms of a mechanism in which the specificity of the priming step can be improved. The first is a 
through the use of a solid support to which one of the PGR primers is (covalentiy) attached. The solid support can take 
many forms such as beads, membranes, etc. The priming step can take place in the hylx)trope and then the solid 

30 support can be washed and moved into a solution that supports ttie polymerase chain extension. The solid support is 
then moved back Into the nesstrope for the priming reaction and the cyde is repeated. The cycling of the solid support 
between the two different solutions only has to occur to a limited number of times (1-15 cycles) after which time the 
traditional amplification cyde in a standardized PGR buffer can be allowed proceed. Alternatively, the target nucleic 
acids of interest are moved between the priming solution and the polymerase extension reaction solution using electric 

35 fields {i.e., electrophoresis). 

[0214] The use of hybotropes and/or abasic or anudeosidic oligonucleotide probes can be used increase the spe- 
cificity and efficiency of isothermal applications of polymerases to the amplification of nudeic acid sequences. Appli- 
cations of isothermal conditions for using nucleic acid polymerases indude nudeic acid sequencing, genotyping, mu- 
tation detection, oligonucleotide ligation assays, mutation detection, and the like. 

40 [0215] Another method used to enhance specificity in hybridization reactions creates base mismatches using base 
analogs to replace any of the A. G. C. or T nudeotkies. Research has shown that some primers containing a base pair 
mismatch have increased specificity when the mismatch is placed in precise locations (see Wenham et al., Clinical 
Chemistry 37:241, 1991;Newton ei al. Nucleic Acids Research 17:2503, 1989; Ishikawa et al.. Human /mmi/no/ogy 
42:315, 1995). However, differences of as littie as 0.5*^0 in the melting temperatures are equally common between 

45 perfectiy matched hybrids and the same hybrid with a single base mismatch inti-oduced (see Tibanyenda et al. European 
Journal of Biochemistry t39:1 9. 1 984; Wemtges et al. Nucleic Acids Research 14:3773, 1 986). Even better specificity 
has been noted between one and two base mismatched duplexes than has been observed between a perfectly matched 
duplex and the same duplex with a single mismatch (see Guo et al., Nature Biotechnology 15:331 , 1997). Guo et al. 
found a (Tm of 4 G between zero and one mismatches and a ATm of 1 3°G between one and two adjacent mismatches 

50 for a 20-mer duplex. However, even with two mismatches, often there is still little destabilization of the duplex. This 
inability to consistently discriminate mismatches lends to \he lack of specificity in PGR. 

[021 6] The use of more than one base pair mismatch per hybridization employing at least one nucleotide analog has 
been evaluated (see Guo et al., Nature Biotechnology 15:331, 1997). In this case, the analog compound consists of 
3-niti-opyrrole replacement of the purine or pyrimidine bases. 3-Nitropyrrole has the ability to minimally hydrogen bond 
55 with all four bases (see Nichols et al, Nature 369:492, 1994; Bergstorm et al., Journal of the American Chemical 
Society 117: 1201, 1995). By introducing an artificial mismatch, large differences in the duplex melting temperatures 
occur ranging from approximately 5°G to 15°G with the largest difference occumng when the mismatch is located at 
the center of the 1 5-mer hybridizing oligo. Significant differences in ATm occur when an artificial nudeotide is introduced 
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into a duplex that already contains a base mismatch creating a two-mismatch duplex. The degree of destabilization 
depends upon the type of base mismatch (e.g., G/T) and the separation between the two mismatches. In experimental 
examination, the base analog nucleotide ranged from 1 to 7 bases to the 3* side of the base mismatch, which was held 
in the center of the 15-mer. Differences in ATm for the three different base mismatched 15-mers ranged from a 2*^0 
5 stabilization (in the C/T mismatch case only and when the mismatches are adjacent) to a 7°C further destabilization 
with the maximum destabilization (x>nsistently occurring at a 3 or 4 base mismatch separation (see Guo et al.. Nature 
Biotechnology 15:33^, 1997), 

[0217] When two artificial mismatches are introduced, the proximity of the artificial bases greatly influences the de- 
gree of destabilization. The two artificial mismatches were centered on the middle of a 21-mer duplex beginning with 
10 a separation of 6 bp. The destabilization. or ATm, is minimally 12°C when compared to the perfectly matched duplex. 
The greatest difference of over 20°C occurs when the two artificial mismatches are 1 0 base pairs apart. This difference 
corresponds to one helical turn and indicates that some kind of interaction occurs between the two artificial bases that 
decreases the stability of the duplex. 

[0218] Experimentally, when the PCR primer utilized contained one or two artificial mismatches between the primer 

15 and the DNA sample, the PCR gave results as would be expected for a perfectly matched primer (see Guo et al., 
Nature Biotechnology f 5:331, 1997). However, when the primer contained both a true and an artificial mismatch, the 
PCR failed to produce any measurable results. While PCR with perfectly matched and true mismatches all produced 
measurable amounts of PCR product. The same study found similar results when using hybridization probes: those 
with perfect matches, true mismatches and artificial mismatches annealed while the probes containing artificial and 

20 true mismatches did not. These studies indicate greater specificity is created when artificial base mismatches are 
incorporated in hybridization reactions such that when naturally occurring mismatches occur, they are thermodynam- 
ically less stable than a perfectly matched hybridization reaction and thus less likely to produce a false positive in an 
assay or PCR. Interestingly, however, the difference in thermodynamic stability noted abowe for duplexes containing 
only artificial mismatches is not manifested in the experimental situation. 

25 [0219] A further means of effecting hybridization discrimination is through differences in the stability between hybrid- 
ization duplexes that contain nicks and gaps. In these reactions, duplexes are formed from tandemly stacked short 
oligomers hybridized to a longer strand that either align contiguously or non-contiguously leaving a few base pair gap. 
Hybridizations that result in a nick are subject to istacking hybridization? where another DNA strand hybridizes across 
the nick site. Stacking hybridization does not occur where gaps are present in the non-contiguous oligomers. The 

30 stacking has the effect of increased discrimination as evidenced by decreased dissociation rates and greater thermo- 
dynamic stability than the non-contiguous counterparts (see Lane et al. Nucleic Acids Res. 25:61 1 , 1 997). Thermody- 
namic measurements show differences between the hybridization stacked duplexes standard free energy change (AG) 
and the gapped duplexes is 1 .4 to 2.4 kcal/mol. Therefore, discrimination in hybridization can be afforded through the 
use of multiple short probes. 

35 [0220] Most of the base mimics in current use are the result of the pursuit for a universal base. Many utilize nitroazole 
base analogues and have demonstrated reduced discrimination in base pairing. A series of nitroazole nucleobase 
analogues have been studied in attempts to gain additional insight Into the significance of electronic structure and 
heterocyclic size in base pairing for the development of more effective universal bases (see Bergstrom et al. Nucleic 
Acids Res. 25:1935, 1997). In this wort^. the thermodynamic properties of the deoxyribonucleosides of 3-nitropyrrole, 

40 4-nitropyrazole, 4-nitroimidazole, and 5-nitroindole were measured. For comparison, thermodynamic measurements 
were also made on the deoxyribonucleosides of hypoxanthine and pyrazole as well an abasic spacer. 1 ,2-dideoxyri- 
bose. Four oligonucleotides were synthesized for each modified nucleoside in order to obtain duplexes in which each 
of the four natural bases was placed opposite the base mimic. All of the base mimics analyzed proved to be far less 
stable than the natural base pairings (A+T: Tm = 65.7^C, C+G: Tm = 70.5°C) with the Tmis ranging from 35-46*'C for 

45 5-nitroindole to 1 8-29°C for the other nitroazole bases analyzed. The only exception was 4-nitroimidazole paired with 
dGTP where the Tm was 40,9**C..ln analyzing the free energy for the duplex melting, the 3-nitropyrrole base mimic 
was found to have the least discrimination when pairing with any of the four naturally occurring bases with an overall 
AG of 0.4kcal/mol. The next least discriminating was 5-nitroindole with a (AG of 0.8 kcal/mol. Both of these values are 
less than the (AG of 1,1 kcal/mol found between the natural base pairings of A+T and G+C. 4-Nitropyrazole showed 

50 a slight preference for pairing with A with a AG ( 1 kcal/mol more stable than C, G, and T free energies. Finally, 4-ni- 
troimidazole showed a high selectivity for pairing to G (as was evidenced by its high Tm value) due to the ability of the 
imidazole N3 to hydrogen bond with the deoxyguanosine N1 . It should be noted, however, that the above values are 
dependent upon the nearest base neighbors to the mimic. Further studies altered the nearest neighbors and found 
that 3-nitropyrrole and 5-nitroindole are quite non-discriminating base pairing partners. 

55 [0221] Of interest, the enthalpy and entropy changes were found to track one another (i.e. a large enthalpy change 
correlates to a large entropy change) regardless of the base mimic utilized implying that the correlation between AS 
and AH is independent of the mode of association of the bases. What was observed was that small enthalpy and 
entropy changes were found in the non-hydrogen bonding base mimics. The low values for entropy change reflect the 



39 



EP 0 990 047 B1 



greater degree of freedom of movement possible for bases that are not locked into the duplex by hydrogen bonding 
interactions. The small enthalpy changes reflect alterations in hydrogen bonding interactions as a result of the loss of 
hydrogen bonding interactions for the base opposite the base mimic. If a natural base remains stacked in the helix 
without an opposing hydrogen bonding partner then It has lost hydrogen bonding interactions with water without re- 

5 gaining a new donor/acceptor partner. 

[0222] A similar study involved examining acyclic nucleoside analogues with carboxamido- or nitro-substituted het- 
" erocyclic bases (see Aerschot et al. Nucleic Acids'Res.'23'A363, 1995): Utilization of acyclic nudeosides endows the 
constructs with enough flexibility to allow good base stacking as well as allow the base mimics to obtain an orientatfon 
to best base-pair with the corresponding base. The heterocyclic bases examined included: 4,5-imidazoledicartx)xam- 

10 ide, 4-nitroimidazole, and 5-nitroindazoie. These complexes were referenced against acyclic hypoxanthine, 1-(2(-de- 
oxy-(-D-ribofuranosyl)-3-nitropyrrole, 5-nitroindole, and 2(-deoxyinosine. All the new acyclic complexes had melting 
temperatures 7-20^C less than those observed for the natural bases, 5-Nitroindazole when paired against each of the 
four natural bases had the least spread in (Tm of only 2,2X while the 4-nitroimidazole had a spread of 8.0°C with dG 
being significantly out of line with the other three bases as had similarly been observed above. Of the reference com- 

15 pounds, deoxyinosine had a ATm of 5.6°C, 5-nilroindoles ATm was 1 .C^C. 1 -(2(-deoxy-(-D-ribofuranosyl)-3-nitropyrrole 
had a ATm of 5.1 •'C. and the ATm of acyclic hypoxanthine was 4.8**C. However, all base mimics showed about the 
same destabilization (ATm 4-5**C) when placed in an oligo consisting almost exclusively of adenosines with exception 
of 4-nitroimidazole and acyclic deoxyinosine that had ATms of 7.0°C and 8.9°C, respectively. 
[0223] Aerschot and co-workers also examined the effect of incorporation of multiple base mimics into an oligo (see 

20 Aerschot et al. Nucleic Acids Res 23:4363. 1995), Overall, melting temperatures dropped but most markedly with the 
incorporation of three base mimics. The nitroindoles. however, showed the least amount of temperature differential. 
[0224] Another base mimic, 1-(2(-deoxy-(-D-ribofuranosyl) imidazole-4-carboxamide (Nucleoside 1 ). mimics prefer- 
entially dA as well as dC nucleosides (see Johnson et al. Nucleic Acids Res. 25:559, 1997). The ability to substitute 
for both dA and dC results from rotation about the carboxamide/imidazole bond as well as the bond between the 

25 imidazole and furanose ring. When the imidazole is anti to the furanose and the carboxamide group is anti to the 
imidazole, the lone pair on the oxygen and one of the amide NH hydrogens is in a position that mimics the NH2 and 
N-1 of adenosine. Imidazole rotation about the glycosidic bond to the syn orientation places the amide group in a 
positton that approximately matches the positions of the NH2 and N-3 of cytosine. 

[0225] When Nucleoside 1 is substituted for any naturally occunring nucleoside, the enthalpy increases with the 

30 greatest increase for a dG substitution for the 1-C pairing (from AH = 74.7 (kcal/mol)/ AG = -16.5 (kcal/mol) for the G/ 
C pairing to AH = -45.5 (kcal/mol)/ AG = -5.8 (kcal/mot)). The smallest enthalpy change occurs for a dA substitution 
(AH = -72.9 (kcal/mol)/ AG = -1 5.4 (kcal/mol) for A/T pairing to AH = - 66.7 (kcal/mol)/ AG = -1 1 .7 (kcal/mol) for the 1 -T 
pairing). Con^espondingly, Tm significantly decreases from 65.7°C and 70.5**C for the A-T and C-G couples, respec- 
tively, to 46.6^C for the 1-T pairing, 43.4°C for 1-G, 27.6°C for 1-A, and 14.6°C for 1-C. 

35 [0226] When used in a PGR reaction. Nucleoside 1 and its N-propyl derivative are preferentially incorporated as 
dATP analogues (see Sala et al. Nucleic Acids Res. 24:3302. 1996). However, once incorporated into a DNA template, 
their ambiguous hydrogen bonding potential gave rise to misincorporation of any of the naturally occumng bases at 
frequencies of (3x10-2 per base per amplification. Most of the substitutions (primarily consisting of G) were a result 
of rotation about the carboxamide bond when part of the template. Between 11-1 5% of the substitutions were due to 

40 rotation of the imidazole moiety about the glycosidic bond. As part of a DNA template, the N-propyl derivative behaved 
in the same way as 1 despite Its propyl moiety. This study indicates that while 1 preferentially behaves as dATP, it has 
the ability in a PGR type environment to behave as all four naturally occurring nucleotides as well. From this and the 
above studies, it is evident that a wide range of duplex stability can be obtained through variations in base mimics and 
their placement within an oligonucleotide. 

45 [0227] Within one aspect of the present invention, methods are provided for determining the identity of a nucleic acid 
molecule or fragment (or for detecting the presence of a selected nucleic acid molecule or fragment), comprising the 
steps of (a) generating tagged nucleic acid molecules from one or more selected target nucleic acid molecules, wherein 
a tag is con-elative with a particular nucleic acid molecule and detectable by non-fluorescent spectrometry or potenti- 
ometry, (b) separating the tagged molecules by size, (c) cleaving the tags from the tagged molecules, and (d) detecting 

50 the tags by non-fluorescent spectrometry or potentiometry, and therefrom determining the kJentity of the nucleic acid 
molecules. 

[0228] Within a related aspect of the invention, methods are provided for detecting a selected nucleic acid molecule, 
comprising the steps of (a) combining tagged nucleic acid probes with target nucleic acid molecules under conditions 
and for a time sufficient to permit hybridization of a tagged nucleic acid probe to a complementary selected target 
55 nucleic acid sequence, wherein a tagged nucleic acid probe is detectable by non-fluroescent spectrometry or potenti- 
ometry, (b) altering the size of hybridized tagged probes, unhybridized probes or target molecules, or the probe:larget 
hybrids, (c) separating the tagged probes by size, (d) cleaving tags from the tagged probes, and (e) detecting tags by 
non-fluorescent spectrometry or potentiometry, and therefrom detecting the selected nucleic acid molecule. These, 
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other related techniques are discussed in more detail below. 
2. PGR 

5 [0229] PGR can amplify a desired DNA sequence of any origin (virus, bacteria, plant, or human) hundreds of millions 
of times in a matter of hours. PGR is especially valuable because the reaction is highly specific, easily automated, and 
capable of amplifying minute amounts of sample. For these reasons, PGR has had a major impact on clinical medicine, " 
genetic disease diagnostics, forensic science and evolutionary biology. 

[0230] Briefly. PGR is a process based on a specialized polymerase, which can synthesize a complementary strand 
10 to a given DNA strand in a mixture containing the 4 DNA bases and 2 DNA fragments (primers, each about 20 bases 
long) flanking the target sequence. The mixture is heated to separate the strands of double- stranded DNA containing 
the target sequence and then cooled to allow (I) the primers to find and bind to their complementary sequences on the 
separated strands and (2) the polymerase to extend the primers into new complementary strands. Repeated heating 
and cooling cycles multiply the target DNA exponentially, since each new double strand separates to become two 
15 templates for further synthesis. In about 1 hour, 20 PGR cycles can amplify the target by a millionfold. 

[0231] Within one embodiment of the invention, methods are provided for determining the Identity of a nucleic add 
molecule, or for detecting the selected nucleic acid molecule in. for example, a biological sample, utilizing the technique 
of PGR. Briefly, such methods comprise the steps of generating a series of tagged nucleic acid fragments or molecules 
during the PGR and separating the resulting fragments are by size. The size separation step can be accomplished 
20 utilizing any of the techniques described herein, including for example gel electrophoresis (e.g., polyacrylamide gel 
electrophoresis) or preferably HPLG. The tags are then deaved from the separated fragments and detected by the 
respective detection technology. Examples of such technologies have been described herein, and Indude for example 
mass spectrometry, infra-red spectrometry, potentiostatic amperometry or UV spectrometry. 

25 3. RNA Fingerprinting and Differential Display 

[0232] When the template is RNA, the first step in fingerprinting is reverse transcription. Liang and Pardee {Science 
257:967, 1 992) were the first to describe an RNA fingerprinting protocol, using a primer for reverse transcription based 
on oligo (dT) but with an 'anchor* of two bases at the 5* end (e.g., oligo 5'-{dT^i)GA-3'. Priming occurs mainly at the 5' 
30 end of the poly(rA) tail and mainly in sequences that end 5'-UpG-poly(rA)-3', with a selectivity approaching one out of 
12 polyadenylated RNAs. After reverse transcription and denaturation, arbitrary priming is performed on the resulting 
first strand of cDNA. PGR can now be used to generate a fingerprint of products that best matches the primers and 
that are derived from the 3' end of the mRNAs and polyadenylated heterogeneous RNAs, This protocol has been 
named 'differential display*. 

35 [0233] Altematively, an arbitrary primer can be used in the first step of reverse transcription, selecting those regions 
intemal to the RNA that have 6-8 base matches with the 3* end of the primer. This is followed by artjitrary priming of 
the resulting first strand of cDNA with the same or a different arbitrary primer and then PGR. This particular protocol 
samples anywhere in the RNA, including open reading frames (Welsh et ah. Nuc. Acids. Res. 20:4965, 1992). In 
addition, it can be used on RNAs that are not polyadenylated, such as many bacterial RNAs. This variant of RNA 

40 fingerprinting by arbitrarily primed PGR has been called f^P-PGR. 

[0234] If arbitrarily primed PGR fingerprinting of RNA is performed on samples derived from cells, tissues or other 
biological material that have been subjected to different experimental treatments or have different developmental his> 
tones, differences in gene expression between the samples can be detected. For each reaction, it is assumed that the 
same number of effective PGR doubling events occur and any differences in the initial concentrations of cDNA products 

45 are preserved as a ratio of intensities in the final fingerprint. There are no meaningful relationships between the inten- 
sities of bands within a single lane on a gel, which are a function of match and abundance. However, the ratio between 
lanes is preserved for each sampled RNA, allowing differentially expressed RNAs to be detected. The ratio of starting 
materials between samples Is maintained even when the number of cydes is sufficient to allow the PGR reaction to 
saturate. This is because the number of doublings needed to reach saturation are almost completely controlled by the 

50 invariant products that make up the majority of the fingerprint. In this regard, PGR fingerprinting is different from con- 
ventional PGR of a single product in which the ratio of starting materials between samples is not preserved unless 
products are sampled in the exponential phase of amplification. 

[0235] Within one embodiment of the invention methods are provided for determining the identity of a nucleic add 
molecule, or for detecting a selecting nudeic acid molecule, in, for example a biological sample, utilizing the technique 
55 of RNA fingerprinting. Briefly, such methods generally comprise the steps of generating a series of tagged nucleic acid 
fragments. The fragments generated by PGR or similar amplification schemes and are then subsequently separated 
by size. The size separation step can be, for example, any of the techniques described herein, induding for example 
gel electrophoresis (e.g.. polyacrylamide gel electrophoresis) or preferably HPLC. The tags are then deaved from the 
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separated fragments, and then the tags are detected by the respective detection technology. Representative examples 

of suitable technologies include mass spectrometry, infra-red spectrometry, potentiostatic amperometry or UV spec- 
trometry. The relative quantities of any given nucleic acid fragments are not important, but the size of the band is 
informative when referenced to a control sample. 

5 

4. Fluorescence-Based PGR Single-Strand Conformation Polymorphism (PCR-^SCP) 



[0236] A number of methods in addition to the RFLP approach are available for analyzing base substitution poly- 
morphisms. Orita, a/, have devised a way of analyzing these polymorphisms on the basis of conformational differ- 
to ences in denatured DNA. Briefly, restriction enzyme digestion or PGR is used to produce relatively small DNA fragments 
which are then denatured and resolved by electrophoresis on non-denaturing polyacrylamide gels. Conformational 
differences in the single-stranded DNA fragments resulting from base substitutions are detected by electrophoretic 
mobility shifts. Intra-strand base pairing creates single strand conformations that are highly sequence-specific and 
distinctive in electrophoretic mobility. However, detection rates in different studies using conventional SSCP range from 
15 35% to neariy 100% with the highest detection rates most often requiring several different conditions. In principle, the 
method could also be used to analyze polymorphisms based on short insertions or deletions. This method is one of 
the most powerful tools for identifying point mutations and deletions In DNA (SSCP-PCR, Dean et al.. Cell 6f :863, 
1990). 

(0237] Within one embodiment of the invention methods are provided for determining the identity of a nucleic acid 
20 molecule, or for detecting a selecting nucleic acid molecule, in. for example a biological sample, utilizing the technique 
of PCR-SSR Briefly, such methods generally comprise the steps of generating a series of tagged nucleic add frag- 
ments. The fi^agments generated by PGR are then separated by size. Preferably, the size separation step is non- 
denaturing and the nucleic acid fragments are denatured prior to the separation methodology. The size separation step 
can be accomplished, for example gel electrophoresis (e.g., polyacrylamide gel electrophoresis) or preferably HPLG. 
25 The tags are then cleaved from the separated fragments, and then the tags are detected by the respective detection 
technology (e.g., mass spectrometry, infra-red spectrometry, potentiostatic amperometry or UV spectrometry). 

5. Dideoxy Fingerprinting (ddP) 

30 [0238] Another method has been described (ddF, Sarkar et al., Genomics 13:441, 1992) that detected 100% of 
single-base changes Iri the human factor IX gene when tested in a retrospective and prospective manner. In total, 84 
of 84 different sequence changes were detected when genomic DNA was analyzed from patients with hemophilia B. 
[0239] Briefly, in the applications of tags for genotyping or other purposes, one method that can be used Is dideoxy- 
fingerprinting. This method utilizes a dideoxy terminator in a Sanger sequencing reation. The principle of the method 

35 is as follows: a target nucleic acid that is to be sequenced is placed in a reaction which possesses a dideoxy-terminator 
complementary to the base known to be mutated in the target nucleic acid. For example, if the mutation results in a 
A->G change, the reaction would be carried out in a C dideoxy-terminator reaction. PGR primers are used to locate 
and amplify the target sequence of interest. If the hypothetical target sequence contains the A->G change, the size of 
a population of sequences is changed due to the incorporation of a dideoxy-terminator in the amplified sequences. In 

40 this particular application of tags, a fragment would be generated which would possess a predictable size in the case 
of a mutation. The tags would be attached to the 5*-end of the PGR primers and provide a "map" to sample type and 
dideoxy-terminator type. A PGR amplification reaction would take place, the resulting fragments would be separated 
by size by for example HPLG or PAGE. At the end of the separation procedure, the DNA fragments are collected in a 
temporal reference frame, the tags are cleaved and the presence or absence of mutation is determined by the chain 

45 length due to premature chain tenminator by the incorporation of a given dideoxy-tenninator. 

[0240] It is important to note that ddf results in the gain or loss of a dideoxy-termination segment and or a shift in the 
mobility of at least one of the termination segments or products. Therefore, in this method, a search is made of the 
shift of one fragment mobility in a high background of other molecular weight fragments. One advantage is the fore- 
knowledge of the length of fragment associated with a given mutation. 

50 [0241] Within one embodiment of the invention methods are provided for determining the identity of a nucleic acid 
molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, utilizing the technique 
of ddF. Briefly, such methods generally comprise the steps of generating a series of tagged nucleic acid fragments, 
followed by separation based upon size. Preferably, the size separation step Is non-denaturing and the nucleic acid 
fragments are denatured prior to the separation methodology. The size separation step can t>e accomplished, for 

55 example gel electrophoresis (e.g., polyacrylamide gel electrophoresis) or preferably HPLG. The tags are then cleaved 
from the separated fragments, and then the tags are detected by the respective detection technology (e.g., mass 
spectrometry, infra-red spectrometry, potentiostatic amperometry or UV spectrometry). 



42 



EP 0 990 047 B1 



6. Restriction Maps and RFLPs 

[02421 Restriction endonucleases recognize stiort DNA sequences and cut DNA molecules at those specific sites. 
Some restriction enzymes (rare-cutters) cut DNA very infrequently, generating a small number of very large fragments 

5 (several thousand to a million bp). Most enzymes cut DNA more frequently, thus generating a large number of small 
_ fragments (less. than a hundredjo more than a thousarid bp). On average, restriction enzymes with 4-base recognition 
sites will yield pieces 256 bases long, 6-base recognition sites will yield pieces 4000 bases long, and 8-base recognition 
sites will yield pieces 64,000 bases long. Since hundreds of different restriction enzymes have been characterized, 
DNA can be cut into many different small firagments. 

10 [0243] A wide variety of techniques have been developed for the analysis of DNA polymorphisms. The most widely 
used method, tiie restriction fragment length polymorphism (RFPL) approach, combines restriction enzyme digestion, 
gel electrophoresis, blotting to a membrane and hybridization to a cloned DNA probe. Polymorphisms are detected as 
variations in the lengths of the labeled fragments on tiie blots. The RFLP approach can be used to analyze base 
substitutions when the sequence change falls witiiin a restriction enzyme site or to analyze minisatellitesA/NTRs by 

15 choosing restriction enzymes that cut outside the repeat units. The agarose gels do not usually afford the resolution 
necessary to distinguish minisatellite/VNTR alleles differing by a single repeat unit, but many of the minlsatellites/ 
VNTRs are so variable that highly informative markers can still be obtained. 

[0244] Within one embodiment of the invention methods are provided for determining the identity of a nucleic acid 
molecule, or for detecting a selecting nucleic add molecule, in, for example a biological sample, utilizing the technique 

20 of restriction mapping or RFLPs. Briefly, such methods generally comprise the steps of generating a series of tagged 
nucleic acid fragments in which the fragments generated are digested with restriction enzymes. The tagged fragments 
are generated by conducting a hybridization step of the tagged probes with the digested target nucleic acid. The hy- 
bridization step can take place prior to or after the resti-iction nuclease digestion. The resulting digested nucleic acid 
fragments are tiien separated by size. The size separation step can be accomplished, for example gel electi-ophoresis 

25 (e.g. , polyacf7lamide gel electrophoresis) or preferably HPLC. The tags are then cleaved from the separated fragments, 
and then the tags are detected by the respective detection technology (e.g., mass spectrometry, infra-red spectrometry, 
potentiostatic amperometry or UV spectrometry). 

7. DNA Fingerprinting 

30 

[0245] DNA fingerprinting involves the display of a set of DNA fragments from a specific DNA sample. A variety of 
DNA fingerprinting techniques are presently available (Jeffreys et al., Nature 374:67-73, 1 985; Zabeau and Vos, 1992); 
"Selective Restriction Fragment Amplification: A General Metiiod for DNA Fingerprinting," European Patent Application 
92402629.7.; Vos et al., "DNA FINGERPRINTING: A New Technique for DNA Fingerprinting." NucL Acids Res. 23: 
35 4407-4414. 1996; Bates. S.R.E., Knorr. DA., Weller, J.W., and Ziegle, J.S.. "Instrumentation for Automated Molecular 
Marker Acquisition and Analysis." Chapter 14, pp. 239-255, in The impact of Plant Molecular Genetics, edited by B. 
W.S. Sobral, published by Birkhauser. 1996. 

[0246] Thus, one embodiment of the invention methods are provkled for determining tiie identity of a nucleic acid 
molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, utilizing the technique 

40 of DNA fingerprinting. Briefly, such methods generally comprise the steps of generating a series of tagged nucleic acid 
fragments, followed by separation of the fragments by size. The size separation step can be accomplished, for example 
gel electrophoresis (e.g.. polyacrytamide gel electrophoresis) or preferably HPLC. The tags are tiien cleaved from the 
separated fragments, and then the tags are detected by ttie respective detection technology (e.g., mass spectrometry, 
infra-red spectrometry, potentiostatic amperometry or UV spectrometry). 

45 [0247] Briefly, DNA fingerprinting is based on the selective PGR amplification of restriction fragments from a total 
digest of genomic DNA. The technique involves three steps: 1 ) restriction of the DNA fragments and subsequent ligation 
of oligonucleotide adaptors. 2) selective amplification of sets of restriction fragments. 3) get analysis of the amplified 
fragments. PGR amplification of the restriction fragments is achieved by using the adaptor and restriction site sequence 
as target sites for primer annealing. The selective amplification is achieved by the use of primers that extend into the 

50 restriction fragments, amplifying only those fragments in which the primer extensions match the nucleotides flanking 
the restriction sites. 

[0248] This method therefore yields sets of restriction fragments which may be visualized by a variety of methods 
(ie., PAGE, HPLC, or other types of spectrometiy) without prior knowledge of the nucleotide sequence. The method 
also allows the co-amplification of large numbers of restriction fragments. The number of fragments however is de- 
55 pendent on the resolution of ttie detection system. Typically, 50-100 resti-iction fragments are amplified and detected 
on denaturing polyacrylamide gels. In the application described herein, ttie separation will be performed by HPLC. 
[0249] The DNA fingerprinting technique is based on the amplification of subsets of genomic resti-iction fragments 
using PGR. DNA is cut with restiiction enzymes and double strand adapters and tiie are ligated to the ends of the DNA 
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fragments to generate template DNA for the amplification reactions. The sequence of the ligated adapters and the 
adjacent restriction enzymes (sites) serve as binding sites for the DNA fingerprinting of primers for PCR-based ampli- 
fication. Selective nucleotides are included at the 3' end of the of the PGR primers which therefore can only prime DNA 
synthesis from a subset of the restriction sites . Only restriction fragments in which the nucleotides flanking the restriction 

5 site can match the selective nucleotide will be amplified. 

[0250] The DNA fingerprinting process produces "fingerprint" patterns of different fragment lengths that are charac- 
~ ' teristic and reproducible for~an individiJal organism. These fingerprints can be use to distinguish even very dosely " 
related organisms, including near-isogenic lines. The differences in fragment lengths can be traced to base changes 
in the restriction site or the primer extension site, or to insertions or deletions in the body of the DNA fragment. 

10 [0251] Dependence on sequence knowledge of the target genome is eliminated by the use of adaptors of known 
sequence that are ligated to the restriction firagments. The PGR primers are specific for the known sequences of the 
adaptors and restriction sites. The steps of the genetic fingerprinting process are described below. 

1 ) Restriction and Ligation. Restriction fragments of genomic DNA are generated by using two different restriction 
15 enzymes: a rare cutter (the six-base recognition enzyme EcoRI) and a frequent cutter (the four-base recognition 

enzyme Msel). Three different types of fragments are produced: ones with EcoRI cuts at both ends, ones with 
Msel cuts at both ends, and ones with an EcoRI cut at one end and an Msel cut at the other end. Double-stranded 
adaptors are then ligated to the sticky ends of the DNA ft-agments. generating template DNA for amplification. The 
adaptors are specific for either the EcoRI site or the Msel site. Restriction and ligation take place in a single reaction. 
20 Ligation of the adaptor to the restricted DNA alters the restriction site so as to prevent a second restriction from 

taking place after ligation has occurred. 

2) Preselectlve Amplification. The sequences of the adaptors and restriction sites serve as primer binding sites 
for the "preselective PGR amplification." The preselective primers each have a "selective" nucleotide that will rec- 
ognize the subset of restriction fragments having the matching nucleotide downstream from the restriction site. 

25 The primary products of the preselective PGR are those ft*agments having one Msel cut and one EcoRI cut, and 

also having the matching internal nucleotide. The preselective amplification achieves a 16-fold reduction of the 
complexity of the fi'agment mixture. 

3) Selective Amplification with CMST-Labeled Primers. The complexity of the PGR product mixture is further re- 
duced (256-fold) and fragments are labeled with a set of GMSTs by carrying out a second PGR using selective 

30 primers labeled with GMSTs. It is possible to choose from among 64 different primer pairs (resulting from all possible 

combinations of eight Msel and eight EcoRI primers) for this amplification. Each of these primers possesses three 
selective nucleotides. The first is the same as that used in the pre-selective amplification; the others can be any 
of the 1 6 possible combinations of the four nucleotides. Only tiiat subset of firagments having matching nucleotides 
at all three positions will be amplified at this stage in the amplification. 

35 

8. Application of Gleisivable Tags to Genotyping and Polymorphism Detection 
a. Introduction 

40 [0252] Although a few known human DNA polymorphisms are based upon insertions, deletions or other rearrange- 
ments of non-repeated sequences, the vast majority are based either upon single base substitutions or upon variations 
In the number of tandem repeats. Base substitutions are very abundant in the human genome, occumng on average 
once every 200-500 bp. Length variations in blocks of tandem repeats are also common in the genome, with at least 
tens of thousands of interspersed polymorphic sites (termed loci). Repeat lengths for tandem repeat polymorphisms 

45 range from 1 bp in (dA)n(dT)n sequences to at least 1 70 bp in a-satellite DNA. Tandem repeat polymorphisms can be 
divided into two major groups which consist of minisatellites/variable number of tandem repeats (VNTRs), with typical 
repeat lengths of tens of base pairs and with tens to thousands of total repeat units, and microsateilites, with repeat 
lengttis of up to 6 bp and with maximum total lengths of about 70 bp. Most of the microsatelitte polymorphisms identified 
to date have been based on (dG-dA)^ or (dG-dT)^ dinudeotide repeat sequences. Analysis of microsatellite polymor- 

50 phisms involves amplification by the polymerase chain reaction (PGR) of a small fragment of DNA containing a block 
of repeats followed by electrophoresis of the amplified DNA on denaturing polyacrylamide gel. The PGR primers are 
complementary to unique sequences that flank the blocks of repeats. Polyacrylamide gels, rather than agarose gels, 
are traditionally used for microsateilites because the alleles often only differ in size by a single repeat. 
[0253] Thus, within one aspect of the present invention methods are provkled for genotyping a selected organism, 

55 comprising the steps of (a) generating tagged nucleic acid molecules from a selected target molecule, wherein a tag 
is con-elative witii a particular fragment and may be detected by non-fluorescent spectrometry or potentiometry, (b) 
separating the tagged molecules by sequential length, (c) cleaving the tag from the tagged molecule, and (d) detecting 
the tag by non-fluorescent spectrometry or potentiometry. and therefrom determining tiie genotype of the organism. 
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[0254] Within another aspect, methods are provided for genotyping a selected organism, comprising the steps of (a) 
combining a tagged nucleic acid molecule with a selected target molecule under conditions and for a time sufficient to 
permit hybridization of the tagged molecule to the target molecule, wherein a tag is correlative with a particular fragment 
and may be detected by non-fluorescent spectrometry or potentiometry. (b) separating the tagged firagments by se- 
5 quential length, (c) cleaving the tag from the tagged fragment, and (d) detecting the tag by non-fluorescent spectrometry 
or potentiometry, and therefrom determining the genotype of the organism. 

b. Application ofcleavable tags to genotyping. 

10 [0255] A PGR approach to identify restriction fragment length polymorphism (RFPL) combines gel electrophoresis 
and detection of tags assoicated with specific PGR primers. In general, one PGR primer will possess one specific tag. 
The tag will therefore represent one set of PGR primers and therefore a pre-determined DNA fragment length. Poly- 
morphisms are detected as variations in the lengths of the labeled fragments in a gel or eluting from a geL Polyacry- 
lamide gel electrophoresis will usually afford the resolution necessary to distinguish minisatelliteA/NTR alleles differing 

15 by a single repeat unit. Analysis of microsatellite polymorphisms involves amplification by the polymerase chain reaction 
(PGR) of a small fragment of DNA containing a block of repeats followed by electrophoresis of the amplified DNA on 
denaturing polyacrylamide get or followed by separation of DNA fragments by HPLG. The amplified DNA will be labeled 
using primers that have cleavable tags at the 5' end of the primer. The primers are incorporated into the newly synthe- 
sized strands by chain extension. The PGR primers are complementary to unique sequences that flank the blocks of 

20 repeats. Minisatellite/VNTR polymorphisms can also be amplified, much as with the microsatellites described above. 
[0256] Descriptions of many types of DNA sequence polymorphisms have provided the fundamental basis for the 
understanding of the structure of the human genome (Botstein et al., Am. J. Human Genetics 32:p314. 1980; Donis- 
Keller, CeH51:3^9, 1987; Weissenbach et al., Nature 359:794). The construction of extensive framework linkage maps 
has been facilitated by the use of these DNA polymorphisms and has provided a practical means for localization of 

25 disease genes by linkage. Microsatellite dinudeotide markers are proving to be very powerful tools in the identification 
of human genes which have been shown to contain mutations and in some instances cause disease. Genomic dinu- 
deotide repeats are highly polymorphic (Weber, 1990, Genomic Analysis, Vol 1. pp 159-181, Gold Spring Laboratory 
Press, Gold Spring Harbor, NY; Weber and Wong, 1993, Hum. Mol. Genetics, 2, p1123) and may possess up to 24 
alleles. Microsatellite dinudeotide repeats can be amplified using primers complementary to the unique regions sur- 

30 rounding the dinudeotkie repeat by PGR. Following amplification, several amplified loci and be combined (multiplexed) 
prior to a size separation step. The process of applying the amplified microsatellite fragments to a size separation step 
and then identifying the size and therefore the allele is known as genotyping. Ghromosome specific markers which 
permit a high level of multiplexing have been reported for performing whole genome scans for linkage analysis (Davies 
etal., 1994. Nature, 371, p130). 

35 [0257] Tags can be used to great effect in genotyping with microsatellites. Briefly, the PGR primers are constructed 
to carry tags and used in a carefully chosen PG reaction to amplify di-. tri-, or tetra- nucleotide repeats. The amplification 
products are then separated according to size by methods such as HPLG or PAGE. The DNA fragments are then 
colleded in a temporal fashion, the tags cleaved from their respective DNA fragments and length deduced from com- 
parison to intemal standards in the size separation step. Allele identification is made from reference to size of the 

40 amplified products. 

[0258] With deavable tags approach to genotyping, it is possible to combine multiple samples on a single separation 
step. There are two general ways In which this can perfonned. The first general method for high through-put screening 
is the detedion of a single polymorphism In a large group of Individuals. In this senario a single or nested set of PGR 
primers is used and each amplification is done with one DNA sample type per reaction. The number of samples that 

45 can be combined in the separation step is proportional to the number of cleavable tags that can be generated per 
detection technology {i.e., 400-600 for mass spedrometer tags). It is therefore possible to identify 1 to several poly- 
morphisms in a large group of individuals simultaneously. The second approach is to use multiple sets of PGR primers 
which can identify numerous polymorphisms on a single DNA sample (genotyping an individual for example). In this 
approach PGR primers are combined in a single amplification reaction which generate PGR products of different length. 

50 Each primer pair or nested set is encoded by a specific cleavable Tag which implies each PGR fragment will be encoded 
witha specific tag. The reaction is run on a single separation step (see below). The number of samples that can be 
combined in the separation step is proportional to the number of deavable tags that can be generated per detection 
technology (i,e., 400-600 for mass spectrometer tags), 

[0259] Genotyping may also be applied to agricultural samples. For example. Amplified Polymorphism Length Pol- 
55 ymorphism (AFLP) analysis allows for agronomically meaningful grouping of the germplasm. The summary of QTL 
effects can reveal several regions of the genome that are consistently important detenninants of multiple quantitative 
traits. The association of AFLP and QTL polymorphism should be of predictive utility in designing matings. Furthermore, 
knowledge regarding the genomic architedure of related and unrelated germplasm at key regions of the genome should 
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allow for a systematic dissection of the basis of selection responses in related germplasm. It should also facilitate the 

systematic introgression of exotic germplasm. Ultimately, a comprehensive description of germplasm at the molecular 
level, coupled with extensive QTL information, should allow breeders- to simultaneously maintain mean performance 
and achieve genetic diversity. 

5 [0260] Once tools are available for the routine characterization and use of genetic resources, there will be greater 
impetus for conservation of genetic resources. 

[0261] Lander and Botslein (Lander, E.S., and D. Botst'ein. "Mapping Mendeliah factors undertying quantitative traits 
using RFLP linkage maps" Genef/cs 121:185-189. 1987, reviewed the evolution of thought leading to the reconciliation 
of the Mendelian theory of particulate inheritance and the observation that many traits exhibit continuous variation. 

10 The development of molecular marker technologies will allow for the development of sufficiently dense maps to initiate 
the dissection of quantitative traits. A major impetus for QTL detection is to manipulate the underlying determinants in 
an applied breeding context. Paterson et at. "DNA markers in plant improvement" Adv. in Agron. 46:39-90, 1991; and 
Dudley, J.W. "Comparison of genetic distance estimators using molecular marker data", Proc. Second Plant Breeding 
Symposium of the Crop Sci. Soc. Amer. and Amer. Soc. Hort., Corvallis, OR, Amer. Soc. Hort. Sci. Alexandria, VA, 

15 1994, each provided excellent overviews of the potential applications of these techniques to breeding. For example, 
association of quantitative trait expression with mapped markers can serve as the basis for making decisions regarding 
manipulation of breeding populations, for molecular marker assisted selection (MMAS). or for discovery of desirable 
genes in exotic germplasm (see also Tanksley et al. "Advanced backcross QTL analysis: a method for the simultaneous 
discovery and transfer of valuable QTLs from unadapted germplasm into elite breeding lines", Theor. Appi Genet 92: 

20 191-203, 1996). QTL for a range of traits, e.g.. yield, malting quality, winterhardiness, and disease resistance, have 
been located in a number of bariey germplasm sources (reviewed by Hayes et al "Barley genome mapping and its 
applicatuns" in P.P. Jauhar (ed.) Methods of Genome Analysis in Plants, ORG Press, Boca Raton, USA, 1996; and 
Hayes. P.M. et al. "Multiple disease resistance loci and their relationship to agronomic and quality loci in a spring barley 
population" JQTL, 1 996, see http://probe.nalusda-gov:8000/otherdocs/jqtl/ index.htm). These reports, like those in oth- 

25 er crops, have been largely descriptive. 

[0262] The genetic bases of these QTL, while of great theoretical interest and significant practical importance, are 
generally not known. It has been proposed that QTL are the consequence of alleles at loci where the effects of other 
alleles can beinterpreted in a simple Mendelian fashion. If this were the case, then QTL alleles should behave in an 
additive manner and they should be transferable from one set of germplasm to another. Alternatively, complex pheno- 

30 types may be the end results of complex pathways (see, e.g., Dawkins, R. "The extended phenotype" Oxford University 
Press, Oxford. UK. 1982). If so. QTL represent the effects of alleles that perturb the complex pathways that culminate 
in quantitative phenotypes. In this scenark>. it would be easy to alter the quantitative phenotype (i.e. the presence of 
an undesirable allele at any point in the pathway would be sufficient to derail the pathway) but difficult to achieve the 
quantitative phenotype (i.e. a host of desired alleles at multiple loci would be required to achieve the quantitative 

35 phenotype). If this were the case, then introgression of QTL alleles into unrelated germplasm would be quite a hit-or- 
miss proposition: germplasm unrelated to the reference mapping population could well have undesirable alleles at loci 
where favorable alleles were fixed in the reference mapping population. 

[0263] The complexity of control will likely vary with the character under study. Ultimately, even the most complex 
phenotype should finally be reducible to a series of discrete, if highly interactive, steps. Relatively few QTL are detected 

40 for complex phenotypes, such as yield, components of malting quality, and quantitative resistance to biotic and abiotic 
stresses (reviewed by Hayes et al., "Barley genome mapping and its applications" in P.P. Jauhar (ed.) Methods of 
Genome Analysis in Plants: ORG Press. Boca Raton, USA. 1996). In some cases, candidate genes can be put forth 
as QTL determinants: shattering resistance as a detemninant of yield (Hayes et al., 1993, above), and hydrolytic en- 
zymes as determinants of malt extract (the amount of soluble carbohydrate available as a substrate for fenmentation) 

45 (see Hayes, P.M. 1996m above). Barley genome mapping: new insights into the malting quality of the worid's oldest 
crop, (see, e.g., MBAA 33:223-225). 

[0264] In other cases, no candidate genes can as yet be put forth as determinants of complex phenotypes. An ex- 
ample is quantitative adult plant resistance to stripe rust, where resistance QTL do not coincide with the reported map 
locations of resistance genes showing patterns of Mendelian inheritance (see Hayes. P.M., et al. "Multiple disease 
50 resistance loci and their relationship to agronomic and quality loci in a spring barley population", JQTL, 1 996 and http: 
//probe.nalusda.gov:8000/otherdocs/jqtl/index.htm). 

[0265] DNA-level polymorphisms can also be used to explore issues of genetic diversity. Genetic diversity has been 
measured in a number of barley gemiplasm arrays, using tools ranging from pedigree analysis (see, e.g., Eslick, R.F. 
et al.. "Genetic engineering as a key to water use efficiency" Agric. Meteor, t4: 13-23, 1974) to morphological traits 
55 (see, e.g., Tolbert, D.M, et al. "A diversity analysis of a worid collection of bariey" Crop Sci. 79:789-794, 1979), to 
molecular mart<ers of various sorts (Melchinger, A.E. et al. "Relationships among European bariey germplasm: 1. Ge- 
netic diversity among winter and spring cultivars revealed by RFLPs" Crop Sci. 34:1191-1199, 1994; Saghai-Maroof, 
M.A. et al. "RFLPs in cultivated bariey and their application In the evaluation of malting quality cultivars" Hereditas 1 21 : 
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21-29, 1994). Molecular markers are particularly attractive from the standpoint of providing abundant, adaptively neu- 
tral, reference points. In the case of molecular polymorphisms, the time and resources required to generate data have 
been a limitation. 

[0266] Linkage map construction is the first step in a systematic QTL analysis. Tools such as the CMST technology 
5 platform disclosed herein as used in mapping should also be useful in enabling breeders to expand genetic diversity 
- - - - . _ . without redudng the value of the working germplasm pooL To date, there has not been sufficient information available 
to integrate a characterization of diversity with the results of QTL analyses. The key to such ah integrative strategy is' 
a marker technology that is quick, cost-effective, and provides abundant polymorphism throughout the genome. The 
present invention provides this key. 

10 

c. Enzymatic detection of mutation and the applications of tags. 

[0267] In this particular application or method, mismatches in heteroduplexes are detected by enzymatic cleavage 
of mismatched base pairs in a given nucleic acid duplex. DNA sequences to be tested for the presence of a mutation 
are amplified by PCR using a specific set of primers, the amplified products are denatured and mixed with denatured 
reference fragments and hybridized which result in the formation of heteroduplexes. The heteroduplexes are then 
treated with enzymes which recognize and cleave the duplex if a mismatch is present. Such enzymes are nuclease 
SI . Mung bean nuclease, "resolvases", T4 endonuclease IV, etc. Essentially any enzyme can be used which recognizes 
mismatches in vitro and cleave the resulting mismatch. The treatment with the appropriate enzyme, the DNA duplexes 
are separated by size, by, for example HPLC or PAGE. The DNA fragments are collected temporally. Tags are cleaved 
and detected. The presence of a mutation is detected by the shift in mobility of a fragments relative to a wild-type 
reference fragmenL 

d. Applications of tags to the oligonucleotide ligation assay (OLA). 

25 

[0268] The oligonucleotide ligation assay as originally described by Landegren et al. (Landegen et al., Science 241: 
487, 1988) is a useful technique for the identification of sequences (known) in very targe and complex genomes. The 
principle of the OLA reaction Is based on the ability of ligase to covalently join two diagnostic oligonucleotides as they 
hybridize adjacent to one another on a given DNA target. If the sequences at the probe junctions are not perfectly 
based-paired, the probes will not be joined by the ligase. The ability of a thermostable ligase to discriminate potential 
single base-pair differences when positioned at the 3* end of the "upstream" probe provides the opportunity for single 
base-pair resolution (Barony, PNAS USA 68:189. 1991 ). In the application of tags, the tags can be attached to a probe 
which is ligated to the amplified producL After completion of the OLR, the fragments are separated on the basis of 
size, the tags cleaved and detected by mass spectrometry. 

e. Sequence specific amplification. 

[0269] PCR primers with a 3* end complementary either to a mutant or normal oligonucleotide sequence can be used 
to selectively amplify one or the other allele (Newton et al., Nuc. Acids Res,, 17, p2503; et al., 1989, Genomics, 5, 
40 p535; Okayama et al„ 1989. J. Lab. Clin. Med., 114, p105; Sommer et al., 1989, Mayo Glin.Proc. 64, 1361; Wu et al.. 
PNAS USA, 86, p2757). Usually the PCR products are visualized after amplification by PAGE, but the principle of 
sequence specific amplification can be applied to solid phase formats. 

f. Application of tags to some amplification based assays. 

45 

[0270] Genotyping of viruses: One application of tags is the genotyping or identification of viruses by hybridization 
with tagged probes. For example. RNA coliphages may be useful candidates as indicators for enteric viais con- 
tamination. Genotyping by nucleic acid hybridization methods is a reliable, rapid, simple, and inexpensive altemative 
to serotyping (Kafatos et, al., Nucleic Acids Res. 7:1541, 1979). Amplification techniques and nucleic aid hybridization 
50 techniques have been successfully used to classify a variety of microorganisms including E. coli (Feng, Mot. Cell Probes 
7:151 , 1993). Representative examples of viruses that may be detected utilizing the present invention include rotavirus 
(Sethabutr eL al., J. Med Virol. 37:192, 1992), hepatitis viruses such as hepatitis C virus (Stuyver eL al„ J. Gen Virol. 
74:1093. 1993), herpes simplex vims (Matsumoto et al., 1 Virol Methods 40:119. 1992). 

(0271 ] Prognostic applications of mutational analysis in cancers: Genetic alterations have been described in a variety 
55 of experimental mammalian and human neoplasms and represent the morphological basis for the sequence of mor- 
phological alterations observed in carcinogenesis (Vogelstein et al.. NEJM 319:525, 1988). In recent years with the 
advent of molecular biology techniques, allelic losses on certain chromosomes or mutation of tumor suppressor genes 
as well as mutations in several oncogenes (e.g., c-myc. c-jun. and the ras family) have been the most studied entities. 
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Previous work (Finkelstein et al.. Arch Surg. 128:526, 1993) has identifled a correlation between specific types of point 
mutations In the K-ras oncogene and the stage at diagnosis in colorectal carcinoma. The results suggested that mu- 
tational analysis could provide important information of tumor aggressiveness, including the pattern and spread of 
metastasis. The prognostic value of TP53 and K-ras-2 mutational analysis in stage III carconoma of the colon has 
5 more recently been demonstrated (Pricolo et al., Am. J. Surg. 171 A'i, 1996). It is therefore apparent that genotyping 
of tumors and pre-cancerous cells, and specific mutation detection will become increasingly important in the treatment 
of cancers in humans! ~ -- --- _ _ . ___ 

9. Single nucleotide extension assay 

10 

[0272] The primer extension technique for the detection of single nucleotide in genomic DNA was first described by 
Sokolov in 1989 {Nudeic Acids Res. 18(12): 3671. 1989), In this paper. Sokolov described the single nucleotide ex- 
tension of 30-mers and 20-mers complementary to the known sequence of the cystic fibrosis gene. It was shown that 
the method had the ability to correctly identify a single nucleotide change within t the gene. The method was based 

15 on the use of radiolabelled deoxynucleotides for a labeling method in the single nucleotide extension assay 

[0273] Later publications described the use of single nucleotide extension assays for genetic diseases such as he> 
mophilia B (factor IX) and the cyctic fibrosis gene (see. e.g., Kuppuswamy et al.. PNAS USA 88:p1 143-1 147, 1991). 
Kuppuswamy et al. showed that the single nucleotide extension assay could be used to detect genetic diseases, the 
application being to the detection of hemophilia B (factor iX) and the cyctic fibrosis gene. Again, this method is based 

20 on the single nucletide primer that is hybridized to a sequence that is adjacent to a known single nucleotide polymor- 
phism. The primed genomic DNA is then subjected to conditions in which Taq polymerase will add a P32 labelled dNTP 
if the site acrosss from the site of interest is complementary to the to the alpha lat)elled in the reaction mixture. 
[0274] Recently, the parameters of the single nucleotide extension assay in terms of the quantitative range, variability, 
and multiplex analysis has been described in detail. Thus, in 1996, Greenwood and Burke (Greenwood, A.D. and 

25 Burke, D.T (1 996) Genome Research, 6, p336-348) described in deail the parameters of the single nucleotide extension 
assy in terms of the quantitative range, variability, and multiplex analysis. RNA served as a template for the PGR 
amplification of a sequence of interest containing a single-base difference between two alleles. Each PCR-generated 
template is analyzed for the presence, absence, or relative amounts of each allele by annealing a primer that is I base 
5* to the polymorphism and extending by 1 labelled base (or using an labelled base). Only when the correct base Is 

30 available in the reaction will incorporation occur at the 3'-end of the primer. Extension products are then analyzed 
(traditionally by PAGE). Thus, this strategy is based on the fidelity of the DNA polymerase to add only the correctly 
paired nucleotide onto the 3' end of the template hybridized primer. Since only one dideoxy-terminator nucleotide is 
added per reaction, it is a simple matter to sort out which primer has been extended in all four types of dNTPs. 
[0275] Hence, within one aspect of the invention methods are provided for the detection of a single selected nucleic 

35 acid within a nucleic add molecule, comprising the steps of (a) hybridizing in at least two separate reactions a tagged 
primer and a target nucleic acid molecule under conditions and for a time sufficient to pemnit hybridization of the primer 
to the target nucleic acid molecule, wherein each reaction contains an enzyme which will add a nucleotide chain ter- 
minator, and. a nucleotide chain terminator complementary to adenosine, cytosine, guanosine, thymidine or uracil, and 
wherein each reaction contains a different nucleotide chain terminator, (b) separating tagged primers by size, (c) deav- 

40 ing the tag from the tagged primer, and (d) detecting the tag by non-fluorescent specA-ometry or potentiometry, and 
therefrom determining the presence of the selected nucleotide within the nudeic acid molecule. 
[0276] As noted herein a wide variety of separation methods may be utilized, induding for example liquid chroma- 
tographic means such as HPLG. In addition, a wide variety of detection methodologies may be utilized, including for 
example, mass spectrometry, infrared spectrometry, ultraviolet spectrometry, or, potentiostatic amperometry. Also, sev- 

45 eral different enzymes may be utilized (e.g.. a polymerase), as well as any of the tags provided herein. Within certain 
preferred embodiments, each primer which is utilized within a reaction has a different unique tag. In this manner, 
multiple samples (or multiple sites) may be simultaneously probed for the presence of selected nucleotides, 
[0277] Single nucleotide assays such as those described herein may be utilized to detect polymorphic variants, or 
to interrogate a biological sample for the presence a specific nudeotide within or near a known sequence. Target 

50 nudeic add molecules indude not only DNA (e.g.. genomic DNA), but RNA as well. 

[0278] In general, this method involves hybridizing a primer to the target DNA sequence such that the 3' end of the 
primer is immediately adjacent to the mutation to be detected and identified. The procedure is similar to the Sanger 
sequencing reaction except that only the dideoxynudeotide of a given nucleotide is added to the reaction mixture. 
Each dideoxynudeotide is labeled with a unique tag. Of the four reaction mixtures, solely one will add a dideoxytermi- 

55 nator on to the primer sequence. If the mutation is present, it will be detected through the unique tag on the dideoxy- 
nudeotide and its identity established . Multiple mutations can be ascertained simultaneously by tagging the DNA primer 
with a unique tag as well. Within one aspect of the invention methods are provided for analyzing single nudeotide 
mutations from a selected biological sample, comprising the steps of exposing nudeic acids from a biological sample 
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and combining the exposed nucleic acids with one or more selected nucleic acid probes, which may or may not be 
tagged, under conditions and for a time sufficient for said probes to hybridize to said nucleic acids, wherein the tag, if 
used, is correlative with a particular nucleic acid probe and detectable by non-fluorescent spectrometry, or potentiom- 
etry. The DNA fragments are reacted in four separate reactions each including a different tagged dideoxyterminator, 

5 wherein the tag is con-elative with a particular dideoxynudeotide and detectable by non-fluorescent spectrometry, or 
potentiometry. The DNA fragments are separated according to size by, for example, gel electrophoresis (e.g.. polyacr- 
ylamide gel electrophoresis) or preferably HPLC. The tags are cleaved from the separated fragments and detected by 
the respective detection technology (e.g., mass spectrometry, infrared spectrometry, potentiostatic amperometry or 
UV/visible spectrophotometry). The tags detected can be con-elated to the particular DNA fragment under investigation 

10 as well as the identity of the mutant nucleotide. 

10. Amplified Fragment Length Polymorphism (AFLP) 

[0279] AFLP was designed as a highly sensitive method for DNA fingerprinting to be used in a variety of fields, 

15 including plant and animal breeding, medical diagnostics, forensic analysis and microbial typing, to name a few. (Vos 
et al, Nucleic Acids Res. 23:4407-4414, 1995.) The power of AFLP is based upon the molecular genetic variations 
that exist between closely related species, varieties or cultivars. These variations in DNA sequence are exploited by 
the genetic fingerprinting technology such that "fingerprints" of particular genotypes can be routinely generated. These 
"fingerprints" are simply RFLPs visualized by selective PGR amplification of DNA restriction fragments. Briefly genetic 

20 fingerprinting technology consists of the following steps: genomic DNA is digested to completion by two different re- 
striction enzymes. Specific double strand oligonucleotide adapters (~ 25-30 bp) are ligated to the restricted DNA 
fragments. Oligonucleotide primers homologous to the adapters, but having extensions at the 3'-end are used to amplify 
a sutsset of the DNA fragments. (A pre-amplification step can also be performed where the extension is only 1 bp in 
length. Amplification with the primer having a 3 base-pair extension would follow.) These extensions can vary in length 

25 from 1 to 3 base-paV, but are of defined length for a given primer The sequence of the extension can also vary from 
one primer to another Init is of a single, defined sequence within a given primer. The selective nature of AFLP-PCR is 
based on the 3' extensions on the oligonucleotide primers. Since these extensions are not homologous to adapter 
sequence, only DNA fragments complementary to the extensions will be ampflfied due to the inability of Taq DNA 
polymerase, unlike some other DNA polymerases, to extend DNAs if mismatches occur at the 3'-end of a molecule 

30 that is being synthesized. Therefore only a subset of the entire genome is amplified in any reaction. For example, if 2 
base-pair (bp) extensions are used, only one in 256 molecules is amplified. To further limit the number of fragments 
that are actually visualized (so that a manageable number is observed), only one of the primers is labeled'. Finally, the 
amplified DNAs are separated on a polyacrylamide gel (sequencing type) and an autonadiograph or phosphor Image 
is generated. 

35 [0280] Within one embodiment of the invention methods are provided for detennining the identity of a nucleic acid 
molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, ufilizing the technique 
of genetic fingerprinting. Briefly, such methods generally comprise the steps of digesting (e.g., genomic DNA) to com- 
pletion by two different restriction enzymes. Specific double-strand oligonucleotide adapters (-- 25-30 bp) are ligated 
to the restricted DNA fragments. Opfional pre-amplification utilizing primers with a 1 bp extension may be performed. 

40 The PGR product is then diluted, and tagged primers homologous to the adapters, but having extensions at the 3'-end 
are used to amplify by PGR a subset of the DNA fragments. The resulting PGR products are then separated by size. 
The size separation step can be accomplished by a variety of methods, including for example. HPLC. The tags are 
then cleaved from the separated fragments, and detected by the respective detection technology (e.g.. mass spec- 
trometry, Infrared spectrometry, potentiostatic amperometry or UV/visible spectrophotometry). 

45 

11. Gene Expression Analysis 

[0281] One of the inventions disclosed herein is a high throughput method for measuring the expression of numerous 
genes (1-2000) in a single measurement The method also has the ability to be done in parallel with greater than one 

50 hundred samples per process. The method Is applicable to drug saeening. developmental biology, molecular medicine 
studies and the like. Within one aspect of the invention methods are provided for analyzing the pattern of gene expres- 
sion from a selected biological sample, comprising the steps of (a) exposing nucleic acids from a biological sample, 
(b) combining the exposed nucleic acids with one or more selected tagged nucleic acid probes, under conditions and 
for a time sufficient for the probes to hybridize to the nucleic acids, wherein the tag is correlative with a particular nucleic 

55 acid probe and detectable by non-fluorescent spectrometry, or potentiometry, (c) separating hybridized probes from 
unhybridized probes, (d) cleaving the tag from the tagged fragment, and (e) detecting the tag by non-fluorescent spec- 
trometry, or potentiometry. and therefrom determining the pattem of gene expression of the biological sample. 
[0282] Within a particulariy preferred embodiment of the invention, assays or methods are provided which are de- 
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scribed as follows: RNA from a target source is bound to a solid support through a specific hybridization step {i.e., 
capture of poIy(A) mRNA by a tethered oligo(dT) capture probe). The solid support is then washed and cDNA is syn- 
thesized on the solid support using standard methods {i.e., reverse transcriptase). The RNA strand Is then removed 
via hydrolysis. The result is the generation of a DNA population which is covalently immobilized to the solid support 

5 which reflects the diversity, abundance, and complexity of the RNA from which the cDNA was synthesized. The solid 
support then interrogated (hybridized) with 1 to several thousand probes that are complementary to a gene sequence 
of interest. Each probe type is labeled with a cleavable mass spectrometry tag or other type of deavable tag. After the ~ 
interrogation step, excess or unhybridized probe is washed away, the solid support is placed, for example, in the well 
of a microtiter plate and the mass spectrometry tag is cleaved from the solid support. The solid support is removed 

10 from the well of sample container, and the contents of the well are measured with a mass spectrometer. The appearance 
of specific mass spectrometer tags indicates the presence of RNA in the sample and evidence that a specific gene is 
expressed in a given biological sample. The method can also be quantifiable. 

[0283] The compositions and methods for the rapid measurement of gene expression using cleavable tags can be 
described in detail as follows. Briefly, tissue (liver, muscle, etc.), primary or transformed cell lines, isolated or purified 

15 cell types or any other source of biological material in which determining genetic expression is useful can be used as 
a source of RNA. In the prefen-ed method, the biological source material is lysed in the presence of a chaotrope in 
orderto suppress nucleases and proteases and support stringent hybridization of target nucleic acid to the solid support. 
Tissues, cells and biological sources can be effectively lysed in 1 to 6 molar chaotropic salts (guanidine hydrochloride, 
guanidine thiocyanate, sodium perchlorate, etc.). After the source biological sample is lysed, the solution is mixed with 

20 a solid support to effect capture of target nucleic acid present In the lysate. In one permutation of the method, RNA is 
captured using a tethered oligo(dT) capture probe. Solid supports can include nylon beads, polystyrene microbeads, 
glass beads and glass surfaces or any other type of solid support to which oligonucleotides can be covalently attached. 
The solid supports are preferentially coated with an amine-polymer such as polyethylene(imine). acrylamide, amine- 
dendrimers, etc. The amines on the polymers are used to covalently immobilize oligonucleotides. Oligonucleotides are 

25 preferentially synthesized with a 5'-amine (generally a hexylamine that includes a six carbon spacer-arm and a distal 
amine). Oligonucleotides can be 15 to 50 nucleotides in length. Oligonucleotides are activated with homo-bifunctional 
or hetero-bifunctional cross-linking reagents such as cyanuric chloride. The activated oligonucleotides are purified from 
excess cross-linking reagent (i.e., cyanuric chloride) by exclusion chromatography. The activated oligonucleotide are 
then mixed with the solid supports to effect covalent attachment. After covalent attachment of the oligonucleotides, the 

30 unreacted amines of the solid support are capped (i.e., with succinic anhydride) to eliminate the positive charge of the 
solid support. The solid supports can be used in parallel and are preferentially configured in a 96-well or 384-well 
format. The solid supports can be attached to pegs, stems, or rods in a 96-weir or 384-well configuration, the solid 
supports either being detachable or alternatively integral to the particular configuration. The particular configuration of 
the solkl supports Is not of critical importance to the functioning of the assay, but rather, affects the ability of the assay 

35 to be adapted to automation. The solid supports are mixed with the lysate for 15 minutes to several hours to effect 
capture of the target nucleic acid onto the solid support. In general, the "capture" of the target nucleic acid is through 
complementary base pairing of target RNA and the capture probe immobilized on the solid support. One pennutation 
utilizes the 3' poly(A) stretch found on most eucaryotic messengers RNAs to hybridize to a tethered oligo(dT) on the 
solid support Another pennutation is to utilize a specific oligonucleotide or long probes (greater than 50 bases) to 

40 capture an RNA containing a defined sequence. Another possibility Is to employ degenerate primers (oligonucleotides) 
that would effect the capture of numerous related sequences in the target RNA population. The sequence complexity 
of the RNA population and the type of capture probe employed guide hybridization times. Hybridization temperatures 
are dictated by the type of chaotrope employed and the final concentration of chaotrope (see Van Ness and Chen, 
Nuc. Acids Res. 1 991 . for general guidelines). The lysate is preferentially agitated continually with the solid support to 

45 effect diffusion of the target RNA. Once the step of capturing the target nucleic acid is accomplished, the lysate is 
washed from the solid support and all chaotrope or hybridization solution is removed. The solid support is preferentially 
washed with solutions containing ionic or non-ionic detergents, buffers and salts. The next step is the synthesis of DNA 
complementary to the captured RNA. In this step, the tethered capture oligonucleotide serves as the extension primer 
for reverse transcriptase. The reaction Is generally performed at 25 to 37°C and preferably agitated during the polym- 

50 erization reaction. After the cDNA is synthesized, it becomes covalently attached to the solkl support since the capture 
oligonucleotide serves as the extension primer. The RNA is then hydrolyzed from the cDNA/RNA duplex. The step can 
be effected by the use of heat that denatures the duplex or the use of base {i.e., 0.1 N NaOH) to chemically hydrolyze 
the RNA. The key result at this step is to make, the cDNA available for subsequent hybridization with defined probes. 
The solid support or set of solid supports is then further washed to remove RNA or RNA fi^gments. At this point, the 

55 solid support contains a approximate representative population of cDNA molecules that represents the RNA population 
in terms of sequence abundance, complexity, and diversity. The next step is to hybridize selected probes to the solid 
support to identify the presence or absence and the relative abundance of specific cONA sequences. Probes are 
preferentially oligonucleotides in length of 15 to 50 nucleotides. The sequence of the probes Is dictated by the end- 
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user of the assay. For example, if the end-user intended to study gene expression in an inflammatory response in a 
tissue, probes would be selected to be complementary to numerous cytokine mRNAs, RNAs that encode enzymes 
that modulate lipids, RNAs that encode factors that regulate cells involved in an inflammatory response, etc. Once a 
set of defined sequences are deflned for study, each sequence is made into an oligonucleotide probe and each probe 

5 is assigned a specific deavable tag. The tag(s) is then attached to the respective oiigonucleotide(s). The oligonucleotide 
(s) are hybridized to the cDNA on the solid support^under appropriate hybridization conditions. After completion of the 
hybridization step, the solid support is washed to remove any unhybridized probe. The solid support or array of supports" 
is then heated to cleave the covalent bond between the cDNA and the solid support. The tagged cDNA fragments are 
then separated according to size by gel electrophoresis (e.g.. polyacrylamide gel electrophoresis) or preferably HPLC. 

10 The tags are then cleaved from the DMA probe molecules, and detected by the respective detection technology (e.g.. 
mass spectrometry, infrared spectrometry, potentiostatic amperometry or UV/visible spectrophotometry). Each tag 
present is identified, and the presence (and abundance) or absence of an expressed mRNA is determined. 
[0284] An altemative procedure would hybridize the tagged DNA probes directly to the tethered mRNA target mol- 
ecules under the appropriate hybridization conditions. After completion of the hybridization step, the solid support is 

15 washed to remove any unhybridized probe. The RNA is then hydrolyzed from the DNA probe/RNA duplex. The step 
can be effected by the use of heat which denatures the duplex or the use of base (/.e.. 0.1 N NaOH) to chemically 
hydrolyze the RNA. This step will leave free mRNA and their con^esponding DNA probes that can then be isolated 
through a size separation step generally consisting of gel electrophoresis (e.g., polyacrylamide gel electrophoresis) or 
preferably HPLC. The tags are then cleaved from the DNA probe molecules, and detected by the respective detection 

20 technology (e.g.. mass spectrometry, infrared spectrometry, potentiostatic amperometry or UV/vislble spectrophotom- 
etry). Each tag present is Identified, and the presence (and abundance) or absence of an expressed mRNA is deter- 
mined. 

[0285] A preferred gene expression assay of the present invention utilizes tagged oligonucleotides in conjunction 
with PGR (or other equally effective technique), lambda exonuclease, ultrafiltration, and an internal standard to afford 
25 quantitative information about gene expression. This preferred gene expression assay, and the submethods thereof, 
are describd next. 

a DESCRIPTION OF INTERNAL STANDARD METHOD 

30 [0286] PGR is a highly sensitive method for the detection of small amounts of DNA or RNA (by RT-PCR). However, 
accurate and precise quantitation of the target is difficult because the amount of amplified product is not always pro- 
portional to the amount of template. This is because PGR reaches a "plateau phase" in which almost the same amount 
of amplified product will be obtained, regardless of the amount of template, after a certain number of cycles. Using an 
intemal standard during PGR helps overcome this problem by co-amplifying known amounts of an internal standard 

35 template, using the same primer set as the target template. Since both templates use the same set of primers, the 
ratio between the amounts of the two amplified products reflects the initial ratio between the amount of target and 
intemal standard template prior to PGR amplification. The amount of target template can then be calculated from the 
known amount of the internal standard template. 

[0287] The prefen-ed intemal standard template for a PGR assay will amplify with identical efficiency to the target 

^0 template. Identical primer sites are built into intemal standard template to assure co-amplification. The region between 
the primer sites in the internal standard template are typically altered from the target template (e.g. deletion or addition 
of 10 to 20 base pairs) to make the templates distinguishable by gel electrophoresis or by restriction enzymes. Modi- 
fications of this type however cause differences in the amplification efficiency of the templates. Typically, a number of 
intemal standard templates are built and tested until nearly identical amplification is found. 

45 [0288] An intemal standard assay format has been developed using the GMST tags and hybotropes. Hybotropes 
are described in, e.g., U. S. Application Nos. 60/026,621 (filed September24, 1996); 08/719,132 (filed September24, 
1996): 08/933,924 (filed September 23, 1997); 09/002,051 (filed December 31. 1997); and International Publication 
No. WO 98/13527, all of which are incorporated herein in their entireties. Basically, hybotrope refers to any chemical 
that can increase the enthalpy of a nucleic acid duplex by 20% or more when referenced to a standard salt solution (/'. 

50 e.. 0.165 M NaGi). A chemical exhibits hybotropic properties when, as a solution an 18 bp oligonucleotide duplex that 
is 50% G+G has a helical-coil transition (HGT) of 15''G or less. HGT is the difference between the temperatures at 
which 80% and 20% of the duplex is single-stranded. The temperature for annealing is then chosen to be the discrim- 
ination temperature, which is a temperature at which a hybridization reaction is performed that allows detectable dis- 
crimination between a mismatched duplex and a perfectly matched duplex. A range of temperatures satisfy criteria of 

55 a discrimination temperature. 

[0289] Because highly specific hybridization can be perfonned using hybotropes, a preferred intemal standard tem- 
plate can be used that is the same length and only 1 base pair different from the target template. This is a prefen-ed 
intemal standard since a priori it will co-amplify identically with the target template. A preferred method of detection of 
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this type of standard and target amplicon mixture is to use hylx)trope buffers to get specific hybridization of amplicon- 
specific tagged-oligonudeotides (preferably being CMST tagged, i.e., having tags detectable by mass spectrometry) 
to their respective amplicons. To detect the amount of amplicon, each amplicon-specific oligonucleotide is tagged with 
a unique mass tag. Amplicon quantititation is derived by measuring the mass spec signal which is associated with the 
5 internal standard and the target amplicons. A known amount of internal standard is spiked into a sample and the amount 
of target amplicon is determined using the following equation, and using the data obtained following the lambda exo- 
nudease and ultrafiittBtions methods described next: 

10 ("^^''9®* mass signal ' *"temal Standard ^^^) X Internal Standard = Target 

b. DESCRIPTION OF LAMBDA EXONUCLEASE METHOD 

[0290] Upon incubation with a DNA duplex, lambda exonuclease selectively digests one strand from a 5* phospho- 
rs rylated end, leaving a single-stranded template suitable for DNA sequencing. Lambda exonuclease prepares single- 
stranded sequencing templates without the effort of traditional biological methods or the tedium of optimizing asym- 
metric PGR. By amplifying DNA in the presence of a primer that contains a 5'-terminal phosphate, one of skill In the 
art can generate a DNA duplex with a 5' phosphorylated end. Afterwards, the PGR product is purified either by precip- 
itation or gel filtration to remove residual primers and other reaction components. The phosphorylated strand of the 
20 DNA duplex is then selectively degraded by lambda exonudease, leaving behind a single-stranded, nonphosphorylated 
template suitable for sequencing. After heat inactivation of the lambda exonudease. the concentrated, single-stranded 
DNA can be added directly to a hybridization reaction employing conventional techniques. 

c. DESCRIPTION OF ULTRAFILTRATION METHOD 

25 

[0291] Genomic DNA as collected from white blood cells may be purified according to standard methods of Rnase 
treatment, proteinase K digestbn, and phenolichloroform extraction followed by precipitation with sodium acetate and 
ethanol. Goncentrations are determined by spectrophotometry and working dilutions of 0.0^\lg/\^\ are prepared. 50 ^ils 
of the DNA samples are laid out into 96 well "mother" plates. "Daughter" plates used for the amplification reaction are 
30 prepared by transfening 1 .5 nl of DNA from the mother plate into the wells of either a 96 well piate or a 192 well plate 
in an identical configuration using 8 channel pipettors. An 18 fii layer of liquid wax (MJ Research) is then 9dded and 
the plate is stored at 4 degrees, where the wax solidifies, preventing evaporatiori. 

[0292] To set up the PGR. the daughter plates are removed and placed on ice to keep the wax solid. This forms a 
barrier between the template DNA and other components of the reaction until the plate is placed in the thermal-cycler 

35 and the reaction is Initiated by heating during the first cyde. The PGR is performed in a total of 10-50 ^iJ per reaction. 
Master mix solutions are prepared ahead of time in bulk and aliquoted into tubes containing ail the components of the 
PGR except the marker-specific primers. The majority of the reactions are perfomied with M13 tailed primers. Ml 3 
tailed primers are a modification of standard PGR primer pairs. The modification is the addition of 17 nudeotides to 
the 5' end of the forward primer. The 17 nudeotide sequence is complementary to the M 13 sequencing primer and 

40 possesses the sequence: 5'-(NH2-G6)-AGG GTT TTG GGA GTG AGG AG-3'. The modification penults the use of a 
third oligonudeotide primer in a PGR reaction. The third primer is typically tagged according to methods decribed herein. 
[0293] Ultrafiltration is the traditional method for concentrating and desalting proteins; it is also an efficient alternative 
to ethanol precipitation of nudeic adds, especially for small amounts, and especially whenever nudeic adds are pre- 
cipitated solely to change solvents. For samples containing phosphate or 10 mM EDTA, ultrafiltration can be a consid- 

45 erable time-saving methodology. Traditionally, such samples required preliminary dialysis to avoid coprecipitatlon of 
salts with nudeic acids during ethanol precipitation. Gentrlfugal microconcentrators desalt and concentrate oligonu- 
cleotide probes or singlle-stranded amplicons (and nucleic acids] in one simple step. 

[0294] Mlcrocon® Microconcentrators are ideal for concentrating 50-500 \i\ samples. In centrifugal ultrafiltration, DNA 
is retained by the membrane. Solvent and salts pass and are removed. A second, inverted spin of Mlcrocon assures 
50 maximal DNA recovery of the probe or nudeic acid of interest. In concentrating the oligomer samples, it is important 
to avoid high salt concentrations, which promote binding of single-stranded nucleic acids to cellulose-based ultrafiltra- 
tion membranes. 

[0295] Typical ultrafilti-ation conditions are as follows: a DNA solution (500 ^I) is spun in a Microcon-30 concentrator 
for 10 minutes at 12,000 x g; a 500 \»i solution of oligonucleotides in TE buffer is spun in a Microcon-3 unit for 45 
55 minutes at 12.000 x g. Retentates may be recovered by Inverting the devices and centrifuging at 500-1000 x g for 2 
minutes. In contrast, gel electrophoresis requires fractionation, elution and desalting of the fragment from the gel slice. 
It also requires enough material to visualize, which is sometimes difficult to obtain, especially in the case of cDNA. 
Both methods (gel electrophoresis and dialysis) are time-consuming and Involve many sample processing steps. 
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[0296] In a typical hybridization reaction, probes are added in 50- to 100-fold molar excess of DNA fragment con- 
centration. For this reason, it is necessary to remove the excess unhybridized probe. Conventional methods for probe 
removal Include gel filtration chromatography or gel electrophoresis. Gel filtration requires that multiple fractions be 
collected analyzed, pooled and precipitated which is not amenable to high throughput assays. However, ultrafiltration 
5 is an effective alternative for rapid removal of excess probe or PGR primers. In Amicon's Centricon disposable con- 
-centrator, the reaction mixture is filtered through an ultrafiltratipn membrane, resulting in the removal of buffer and non- 
hybridized probe or non-extended primers. The concentrated fragments are retained by the membrane. Driving force 
for filtration is provided by centrifugation in a fixed-angle rotor at 1 .000-5.000 x g. Conventional methods typically 
require 24 hours for sample processing and 2-3 hours of hands-on time. With Centricon. samples typically take less 
10 than a few hours to purify. Sample handling is minimal and many samples can be processed at the same time. For 
reference see Krowczynska, A..M., "Efficient Purification of PCR Products using Ultrafiltration" BioTechniques 13(2): 
286-289, 1992. 

12. Hybridization Techniques 

15 

[0297] The successful cloning and sequencing of a gene leads to the investigation of its staicture and expression 
by making it possible to detect the gene or its mRNA in a large pool of unrelated DNA or RNA molecules. The amount 
of mRNA encoding a specific protein in a tissue is an important parameter for the activity of a gene and may be sig- 
nificantly related to the activity of function systems. Its regulation is dependent upon the interaction between sequences 
20 within the gene (cis-acting elements) and sequence-specific DNA binding proteins (trans-acting factors), which are 
activated tissue-specifically or by hormones and second messenger systems. 

[0298] Several techniques are available for analysis of a particular gene, its regulatory sequences, its specific mRNA 
and the regulation of its expression; these include Southern or Northern blot analysis and ribonuclease (RNase) pro- 
tection assay. 

25 [0299] Variations in the nucleotide composition of a certain gene may be of great pathophysiological relevance. When 
localized in the non-coding regions (5*. 3'-flanking regions and intron). they can affect the regulation of gene expression, 
causing abnormal activation or inhibition. When localized in the coding regions of the gene (exons), they may result 
in alteration of the protein funcfion or dysfunctional proteins. Thus, a certain sequence within a gene can correlate to 
a specific disease and can be useful as a marker of the disease. One primary goal of research in the medical field is, 

30 therefore, to detect those genetic variations as diagnostic tools, and to gain important information for the understanding 
of pathophysiological phenomena. The basic method for the analysis of a population regarding the variations within a 
certain gene is DNA analysis using the Southern blot technique. Briefly, prepared genomic DNA is digested with a 
• restriction enzyme (RE), resulting in a large number of DNA fragments of different lengths, determined by the presence 
of the specific recognition site of the RE on the genome. Alleles of a certain gene with mutations inside this restriction 

35 site will be cleaved into fragments of different number and length. This is called restriction fragment length polymorphism 
(RFLP) and can be an important diagnostic marker with many applications. The fragment to be analyzed has to be 
separated from the pool of DNA fragments and distinguished from other DNA species using a specific probe. Thus, 
DNA is subjected to electrophoretic fractionation using an agarose gel, followed by transfer and fixation to a nylon or 
nitrocellulose membrane. The fixed, single-stranded DNA is hybridized to a tagged DNA that is complementary to the 

40 DNA to be detected. After removing non-specific hybridizations, the DNA fragment of interest can be visualized ac- 
cording to the probes characteristics (autoradiography or phosphor image analysis). 

[0300] Within one embodiment of the invention methods are provided for determining the identity of a nucleic acid 
molecule, or for detecting a selecting nucleic acid molecule, in. for example a biological sample, utilizing the techniques 
similar to Southern blotting. Briefly, such methods generally comprise the steps of generating a series of tagged nucleic 
45 acid fragments in which the fragments generated are digested with restriction enzymes. The tagged fragments are 
generated by conducting a hybridization step of the tagged probes with the digested target nucleic acid. The hybridi- 
zation step can take place prior to or after the restriction nuclease digestion. The resulting digested nucleic acid frag- 
ments are then separated by size. The size separation step can be accomplished, for example, by gel electrophoresis 
(e.g., polyacrylamide gel electrophoresis) or preferably HPLC. The tags are then cleaved, from the separated frag- 
so ments, and then the tags are detected by the respective detection technology (e.g., mass spectrometry, infrared spec- 
trometry, potentiostatic amperometry or UV/visible spectrophotometry). 

[0301] The presence and quantification of a specific gene transcript and its regulation by physiological parameters 
can be analyzed by means of Northern blot analysis and RNase protection assay The principle basis of these methods 
is hybridization of a pool of total cellular RNA to a specific probe. In the Northern blot technique, total RNA of a tissue 
55 is fractionated using an HPLC or LC method, hybridized to a labeled antisense RNA (cRNA). complementary to the 
RNA to be detected. By applying stringent washing conditions, non-specifically bound molecules are eliminated. Spe- 
cifically bound molecules, would subsequently be detected according to the type of probe utilized (mass spectrometfy, 
or with a electrochemical detector). In addition, specificity can be controlled by comparing the size of the detected 
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mRNA with the predicted length of the mRNA of interest. 

[0302] Within one embodiment of the invention methods are provided for determining the identity of a ribonucleic 
acid molecule, or for detecting a selecting ribonucleic acid molecule, in, for example a biological sample, utilizing the 
techniques similar to Northern blotting. Briefly, such methods generally comprise the steps of generating a series of 
5 tagged RNA molecules by conducting a hybridization step of the tagged probes with the target RNA. The tagged RNA 
_ molecules are_then_separated_by_slze. The size separation step can be accomplished , for example by preferably HPLC. 
The tags are cleaved from the separated RNA molecules, and then the tags are detected by the respective detection 
technology (e.g., mass spectrometry, infrared spectrometry, potentiostatic amperometry or UV/visible spectrophotom- 
etry). 

10 [0303] The most specific method for detection of a mRNA species is the RNase protection assay. Briefly, total RNA 
from a tissue or cell culture is hybridized to a tagged spedfic cRNA of complete homology. Spedficity is accomplished 
by subsequent RNase digestion. Non-hybridized, single-stranded RNA and non-specifically hybridized fragments with 
even small mismatches will be recognized and cleaved, while double-stranded RNA of complete homology is not 
accessible to the enzyme and will be protected. After removing RNase by proteinase K digestion and phend extraction, 

15 the specific protected fragment can be separated from degradation products, usually by HPLC. 

[0304] Within one embodiment of the invention methods are provided for detenmining the identity of a ribonucleic 
acid molecule, or for detecting a selecting ribonudeic acid molecule, in, for example a biological sample, utilizing the 
technique of RNase protection assay. Briefly, such methods generally comprise the steps of total RNA from a tissue 
or cell culture being hybridized to a tagged specific cRNA of complete homology, a RNase digestion, treatment with 

20 proteinase K and a phenol extraction. The tagged, protected RNA fragment is isolated from the degradation products. 
The size separation step can be accomplished, for example by LC or HPLC. The tag is deaved from the separated 
RNA molecules, and then is detected by the respedive detection technology (e.g., mass spectrometry. Infrared spec- 
trometry, potentiostatic amperometry or UV/visible spectrophotometry). 

25 13. Mutation Detection Techniques 

[0305] The detection of diseases is increasingly important in prevention and treatments. While multifactorial diseases 

are difficult to devise genetic tests for, more than 200 known human disorders are caused by a defect in a single gene. 

often a change of a single amino acid residue (01 sen, Biotechnology: An industry comes of age. National Academic 
30 Press, 1986). Many of these mutations result in an altered amino acid that causes a disease state. 

[0306] Sensitive mutation detection techniques offer extraordinary possibilities for mutation saeening. For example. 

analyses may be performed even before the implantation of a fertilized egg (Holding and Monk, Lancet 3:532, 1989). 

Increasingly efficient genetic tests may also enable screening for oncogenic mutations in cells exfoliated firom the 

respiratory tract or the bladder in connection with health checkups (Sidransky et al., Science 252:706, 1991). Also, 
35 when an unknown gene causes a genetic disease, methods to monitor DNA sequence variants are useful to study the 

inheritance of disease through genetic linkage analysis. However, detecting and diagnosing mutations in individual 

genes poses technological and economic challenges. Several different approaches have been pursued, but none are 

both, efndent and inexpensive enough for truly wide-scale application. 

[0307] Mutations involving a single nucleotide can be identified in a sample by physical, chemical, or enzymatic 

^0 means. Generally, methods for mutation detection may be divided into scanning techniques, which are suitable to 
identify previously unknown mutations, and techniques designed to detect, distinguish, or quantitate known sequence 
variants. Several scanning techniques for mutation detection have been developed in heteroduplexes of mismatched 
complementary DNA strands, derived from wild type and mutant sequences, exhibit an abnormal behavior espedally 
when denatured. This phenomenon is exploited in denaturing and temperature gradient gel electrophoresis (DGGE 

45 and TGGE, respectively) methods. Duplexes mismatched in even a single nudeotide position can partially denature, 
resulting in retarded migration, when electrophoresed in an increasingly denaturing gradient gel (Myers et al.. Nature 
313:495. 1985; Abrades et al. Genomics 7:463, 1990; Henco et al.. Nucl. Acids Res. 18:6733, 1990). Although mu- 
tations may be deteded, no information is obtained regarding the precise location of a mutation. Mutant forms must 
be further isolated and subjected to DNA sequence analysis. Alternatively, RNase A may deave a heteroduplex of an 

50 RNA probe and a target strand at a position where the two strands are not property paired. The site of cleavage can 
then be determined by electrophoresis of the denatured probe. However, some mutations may escape detection be- 
cause not all mismatches are efficiently deaved by RNase A. Mismatched bases in a duplex are also susceptible to 
chemical modification. Such modifications can render the strands susceptible to cleavage at the site of the mismatch 
or cause a polymerase to stop in a subsequent extension reaction. The chemical deavage tedinique allows identifi- 

55 cation of a mutation in target sequences of up to 2 kb and it provides information on the approximate location of 
mismatched nudeotide(s) (Cotton et al., PNAS USA 85:4397, 1988; Ganguly et al.. Nucl. Acids Res. f 8:3933. 1991). 
However, this technique is labor intensive and may not identify the precise location of the mutation. 
[0308] An alternative strategy for detecting a mutatton in a DNA strand is by substituting (during synthesis) one of 



54 



EP 0 990 047 B1 



the normal nucleotides with a modified nucleotide, altering the molecular weight or other physical parameter of the 
product. A strand with an increased or decreased number of this modified nucleotide relative to the wild-type sequence 
exhibits altered electrophoretic mobility (Naylor et al., Lancet 337:635, 1991). This technique detects the presence of 
a mutation, but does not provide the location. 

5 [0309] Two other strategies visualize mutations in a ONA segment by altered gel migration, tn the single-strand 
conformation polymojphism tedinjque (SSCP), mutations cause denatured strands to adopt different secondary struc- 
tures, thereby influencing mobility during native gel electrophoresis. Heteroduplex DNA molecules, containing intemal 
mismatches, can also be separated from correctly matched molecules by electrophoresis (Orita, Genomics 5:874, 
1989; Keen, Trends Genet 7:5, 1991). As with the techniques discussed above, the presence of a mutation may be 

10 determined but not the location. As well, many of these techniques do not distinguish between a single and multiple 
mutations. All of the above-mentioned techniques indicate the presence of a mutation in a limited segment of DNA and 
some of them allow approximate localization within the segment. However, sequence analysis is still required to unravel 
the effect of the mutation on the coding potential of the segment. Sequence analysis is very powerful, allowing, for 
example, screening for the same mutation in other individuals of an affected family, monitoring disease progression in 

15 the case of malignant disease or for detecting residual malignant cells in the bone marrow before autologous trans- 
plantation. Despite these advantages, the procedure is unlikely to be adopted as a routine diagnostic method because 
of the high expense involved. 

[0310] A large number of other techniques have been developed to analyze known sequence variants. Automation 
and economy are very important considerations for these types of analyses that may be applied, for saeening individ- 
20 uals and the general population. None of the techniques discussed below combine economy and automation with the 
required specificity. 

[031 1] Mutations may be identified via their destabilizing effects on the hybridization of short oligonucleotide probes 
to a target sequence (see Wetmur. Crit Rev, Biochem. MoL BioL 26:227, 1 991 ). Generally, this technique, allele-specific 
oligonucleotide hybridization, involves amplification of target sequences and subsequent hybridization with short oli- 
25 gonucleotide probes. An amplified product can thus be scanned for many possible sequence variants by determining 
its hybridization pattern to an array of immobilized oligonucleotide probes. However, establishing conditions that dis- 
tinguish a number of other strategies for nucleotide sequence distinction all depend on enzymes to identify sequence 
differences (Saiki, PNAS USA 56:6230. 1989; Zhang. NucL Acids Res, t9:3929. 1991). 

[0312] For example, restriction enzymes recognize sequences of about 4-8 nucleotides. Based on an average G+C 
30 content, approximately half of the nucleotide positions in a DNA segment can be monitored with a panel of 1 GO restriction 
enzymes. As an alternative, artificial restriction enzyme recognition sequences may be created around a variable po- 
sition by using partially mismatched PGR primers. With this technique, either the mutant or the wild-type sequence 
alone may be recognized and cleaved by a restriction enzyme after amplification (Chen et al., Anal. Biochem. f 95:51 , 
1991; Levi et al.. Cancer Res. 5t:3497, 1991). Another method exploits the property that an oligonucleotide primer 
35 that is mismatched to a target sequence at the 3' penultimate position exhibits a reduced capacity to serve as a primer 
in PGR. However, some 3' mismatches, notably G-T, are less inhibitory than others limiting its usefulness are. In at- 
tempts to improve this technique, additional mismatches are incorporated into the primer at the third position from the 
3* end. This results in two mismatched positions in the three 3' nucleotides of the primer hybridizing with one allelic 
variant, and one mismatch in the third position in from the 3* end when the primer hybridizes to the other allelic variant 
40 (Newton et al., Nucl. Acids Res. 1 7:2503, 1989), It is necessary to define amplification conditions that significantly 
favor amplification of a 1 bp mismatch. 

[0313] DNA polymerases have also been used to distinguish allelic sequence variants by determining which nucle- 
otide is added to an oligonucleotide primer immediately upstream of a variable position in the target strand. 
[0314] A ligation assay has been developed. In this method, two oligonucleotide probes hybridizing in immediate 
45 juxtaposition on a target strand are joined by a DNA ligase. Ligation is inhibited if there is a mismatch where the two 
oligonucleotide probes abut. 

14. Assays for Mutation Detection 

50 [0315] Mutations are a single-base pair change in genomic DNA. Within the context of this invention, most such 
changes are readily detected by hybridization with oligonucleotides that are complementary to the sequence in ques- 
tion. In the system described here, two oligonucleotides are employed to detect a mutation. One oligonucleotide pos- 
sesses the wild-type sequence and the other oligonucleotide possesses the mutant sequence. When the two oligonu- 
cleotides are used as probes on a wild-type target genomic sequence, the wild-type oligonucleotide will form a perfectly 

55 based paired structure and the mutant oligonucleotide sequence will form a duplex with a single base pair mismatch. 
As discussed above, a 6 to 7*'C difference in the Tm of a wild type versus mismatched duplex permits the ready 
identification or discrimination of the two types of duplexes. To effect this discrimination, hybridization is performed at 
the Tm of the mismatched duplex in the respective hybotropic solution (see. e.g., U.S. Application Nos. 60/026,621 
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(filed September 24, 1 996); 08/71 9. 1 32 (filed September 24, 1 996); 08/933,924 (filed September 23, 1 997); 09/002,051 
(filed December 31 , 1997); and International Publication No. WO 98/13527, all of which are incorporated herein in their 
entireties, for a description of hybotropic solutions). The extent of hybridization is then measured for the set of oligo- 
nucleotide probes. When the ratio of the extent of hybridization of the wild-type probe to the mismatched probe is 
5 measured, a value to 10/1 to greater than 20/1 is obtained. These types of results permit the development of robust 
assays for mutation detection. 

[031 6] For exemplary purpbsesT o"ne~as~say format for mutation detection utilizes target nucleic acid (e.g. , genomic 
DNA) and oligonucleotide probes that span the area of interest. The oligonucleotide probes are greater or equal to 24 
nt in length (with a maximum of about 36 nt) and labeled with a fluorochrome at the 3' or 5' end of the oligonucleotide 

10 probe. The target nucleic acid is obtained via the lysis of tissue culture cells, tissues, organisms, etc., in the respective 
hybridization solution. The lysed solution is then heated to a temperature that denatures the target nucleic acid ( 1 5-25''C 
above the Tm of the target nudeic acid duplex). The oligonucleotide probes are added at the denaturation temperature, 
and hybridization is conducted at the Tm of the mismatched duplex for 0.5 to 24 hours. The genomic DNA is then 
collected and by passage through a GF/C (GF/B, and the like) glass fiber filter. The filter is then washed with the 

15 respective hybridization solution to remove any non-hybridized oligonucleotide probes (RNA, short oligos and nucleic 
acid does not bind to glass fiber filters under these conditions). The hybridization oligo probe can then be thermally 
eluted from the target DNA and measured (by fluorescence for example). For assays requiring very high levels of 
sensitivity, the probes are concentrated and measured. 

[0317] Other highly sensitive hybridization protocols may be used. The methods of the present invention enable one 

20 to readily assay for a nucleic acid containing a mutation suspected of being present in cells, samples, etc., /.e.. a target 
nucleic acid. The target nucleic acid contains the nucleotide sequence of deoxyribonucleic acid (DNA) or ribonucleic 
acid (RNA) whose presence is of interest, and whose presence or absence Is to be detected for in the hybridization 
assay. The hybridization methods of the present Invention may also be applied to a complex biological mixture of 
nudeic acid (RNA and/or DNA). Such a complex biological mixture indudes a wide range of eucaryotic and procaryotic 

25 cells, induding protoplasts; and/or other biological materials which harbor polynucleotide nucleic acid. The method is 
thus applicable to tissue culture cells, animal cells, animal tissue, blood cells (e.g., reticulocytes, lymphocytes), plant 
cells, bacteria, yeasts, vimses, mycoplasmas, protozoa, fungi and the like. By detecting a specific hybridization between 
nucleic acid probes of a known source, the specific presence of a target nudeic acid can be established. A typical 
hybridization assay protocol for detecting a target nucleic add in a complex population of nudeic acids is described 

30 as follows: Target nudeic acids are separated by size on an LC or HPLC, cloned and isolated, sub-divided into pools, 
or left as a complex population. Within one embodiment of the invention methods are provided for determining the 
identity of a nuderc acid molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, 
utilizing the general techniques of hybridization assays. Briefly, such methods generally comprise the steps of target 
nucleic acids being doned and isolated, sub-divided into pools, or left as a complex population. The target nudeic 

35 acids are hybridized with tagged oligonucleotide probes under conditions described above. The target nucleic acids 
are separated according to size by LC or HPLC. The tags are deaved from the separated fragments, and then the 
tags are detected by the respective detection technology (e.g., mass spectrometry, infrared spectrometry, potentiostatic 
amperometry or UV/vislble spectrophotometry). 

^0 15. Sequencing by hybridization 

[0318] DNA sequence analysis is conventionally perfomied by hybridizing a primer to target DNA and performing 
chain extensions using a polymerase. Spedflc stops are controlled by the indusion of a dideoxynudeotide. The spe- 
dflcity of priming in this type of analysis can be Inaeased by induding a hybotrope in the annealing buffer and/or 

^5 incorporating an abasic residue in the primer and annealing at a discriminating temperature. 

[0319] Within one emlwdiment of the invention methods are provided for determining the identity of a nucleic acid 
molecule, or for detecting a selecting nucleic add molecule, in, for example a biological sample, utilizing the general 
techniques of sequencing by hybridization using the Sanger method. Briefly, such methods generally comprise the 
steps of hybridizing a tagged primer to target DNA and performing chain extensions using a polymerase. Speciflc stops 

50 are controlled by the indusion of a dideoxynudeotide that may also be tagged. The target nudeic adds are separated 
according to size by HPLC. The tags are deaved from the separated fragments, and are detected by the respective 
detection technology (e.g., mass spectrometry, infrared spectrometry, potentiostatic amperometry or UV/visible spec- 
trophotometry). Other sequence analysis methods involve hybridization of the target with an assortment of random, 
short oligonudeotides. The sequence is constructed by overiap hybridization analysis. In this technique, precise hy- 

S5 bridization is essential. Use of hybotropes or abasic residues and annealing at a discriminating temperature is beneficial 
for this technique to reduce or eliminate mismatched hybridization. The goal is to develop automated hybridization 
methods in order to probe large arrays of oligonucleotide probes or large arrays of nucleic acid samples. Applications 
of such technologies Indude gene mapping, done characterization, medical genetics and gene discovery, DNA se- 
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quence analysis by hybridization, and finally, sequencing verification: Many parameters must be controlled in order to 
automate or multiplex oligonucleotide probes. The stability of the respective probes must be similar, the degree of 
mismatch with the target nucleic acid, the temperature, ionic strength, the A+T content of the probe (or target), as well 
as other parameters when the probe is short [i.e., 6 to 50 nucleotides) should be similar. Usually, the conditions of the 

5 experiment and the sequence of the probe are adjusted until the formation of the perfectly based-paired probe is 
thermodynamically favored over the any duplex that contains a mismatch. Very largescale applications of probes such 
as sequencing by hybridization (SBH), or testing highly polymorphic loci such as the cystic fibrosis transHnembrane 
protein locus require a more stringent level of control of multiplexed probes. Within one embodiment of the invention 
methods are provided for determining the identity of a nucleic acid molecule, or for detecting a selecting nucleic acid 

10 molecule, in, for example a biological sample, utilizing the general techniques of sequencing by hybridization. Briefly, 
such methods generally comprise of hybridizing a series of tagged primers to a DNA target or a series of target DNA 
fragments under carefully controlled conditions. The target nucleic acids are separated according to size by HPLC. 
The tags are then cleaved from the separated fragments, and detected by the respective detection technology (e.g.. 
mass spectrometry, infrared spectrometry, potentiostatic amperometry or UV/visible spectrophotometry). 

15 

16. Oligonucleotide-Ligation Assay 

[0320] Oligonucleotide-ligation assay is an extension of PCR-based screening that uses an ELISA-based assay 
(01^. Nickerson et al., Proc. Natl. Acad. Sci. USA 87:8923, 1990) to detect the PGR products that contain the target 

20 sequence. Thus, both gel electrophoresis and colony hybridization are eliminated. Briefly, the OLA employs two adja- 
cent oligonucleotides: a "reporter" probe (tagged at the 5' end) and a 5'-phosphorylated/3'-biotinylated "anchor" probe. 
The two oligonucleotides, which are complementary to sequences intemal to the PGR primers, are annealed to target 
DNA and. if there is perfect complementarity, the two probes are ligated by T4 DNA ligase. Capture of the biotlnylated 
anchor probe on immobilized streptavidin and analysis for the covalently linked reporter probe test for the presence 

25 or absence of the target sequences among the PGR products. Within one embodiment of the invention methods are 
provided for determining the identity of a nucleic acid molecule, or for detecting a selecting nucleic acid molecule, in. 
for example a biological sample, utilizing the technique of oligonucleotide ligation assay. Briefly, such methods generally 
comprise the steps of performing PGR on the target DNA followed by hybridization with the 5' tagged Ireporterf DNA 
probe and a 5' phosphorylated/non-biotinylated probe. The sample is incubated with T4 DNA ligase. The DNA strands 

30 with ligated probes can be separated from the DNA with non-ligated probes by, for example, preferably by LG or HPLG. 
The tags are cleaved from the separated fragments, and then the tags are detected by the respective detection tech- 
nology (e.g., mass spectrometry, infrared spectrophotometry, potentiostatic amperometry or UV/visible spectropho- 
tometry. Recent advances In the OLA assay have allowed for the analysis multiple samples and multiple mutations 
concun-ently. (Baron et al.. Nature Biotechnology 87:1279, 1996.) Briefly, the method consists of amplifying the gene 

35 fragment containing the mutation of interest with PGR. The PGR product is then hybridized with a common and two 
allele-specific oligonucleotide probes (one containing the mutation while the other does not) such that the 3* ends of 
the allele-specific probes are immediately adjacent to the 5' end of the common probe. This sets up a competitive 
hybridization-ligation process between the two allelic probes and the common probe at each locus. The thermostable 
DNA ligase then discriminates between single-base mismatches at the junction site, thereby producing allele-specific 

40 ligation products. The common probe is labeled with one of four fluorophores and the allele-specific probes are each 
labeled with one or more pentaethyleneoxide mobility modifying tails which provide a sizing difference between the 
different allele-specific probes. The samples are then separated by gel electrophoresis based upon the length of the 
modifying tails and detected by the fluorescent tag on the common probe. Through the use in sizing differences on the 
allele-specific probes and four fluorophores available for the common probe, many samples can be analyzed on one 

45 lane of the electrophoretic gel. Within one embodiment of the invention methods are provided for detenmining the 
identity of a nucleic acid molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, 
utilizing the technique of oligonucleotide ligation assay for concurrent multiple sample detection. Briefly, such methods 
generally comprise the steps of performing PGR on the target DNA followed by hybridization with the common probe 
(untagged) and two allele-specific probes tagged according to the specifications of the invention. The sample is incu- 

50 bated with DNA ligase and fragments separated by, for example, preferably by LG or HPLG. The tags are cleaved from 
the separated fragments, and then the tags are detected by the respective detection technology (e.g., mass spectrom- 
etry, infrared spectrophotometry, potentiostatic amperometry or UV/visible spectrophotometry, 

17. Differential Display 

55 

a. Overview 

[0321] Mammals, such as human beings, have about 100,000 different genes in their genome, of which only a small 
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fraction, perhaps 15%, are expressed in any individual celL The choice of genes expressed detennine the biochemical 
character of any given cell or tissue. The process of normal cellular growth and differentiation, as well as the pathological 
changes that arise in diseases like cancer, are all driven by changes in gene expression. Differential display methods 
permits the identification of genes specifically expressed in individual cell types. 

5 [0322] The differential display technique amplifies the 3' terminal portions of corresponding cDNAs by using a primer 
designed to bind to the 5' boundary of a poly(A) tail and primers of arbitrary sequence that bind upstream. Amplified 

' ~ populations with each primer pair are visualized by a size separation method (PAGE. HPLC, etc.), allowing direct 

comparison of the mRNAs between two biological samples of interest. The differential display method has the potential 
to visualize ail the expressed genes (about 1 0,000 to 1 5,000 mRNA species) in a mammalian cell and enables sequence 

10 analysis. It is possible to compare: (1 ) the total number of peaks amplified in the parents, (2) the number of polymorphic 
peaks between parents, and (3) the segregation ratios of polymorphic peaks in the progeny of crosses in animals or 
plants. Differential display is also used for the identification of up- and down-regulated genes, known or unknown, after 
a variety of stimuli. Differential display PCR fragments can be used as probes for cDNA cloning (discovering an unknown 
gene from a cDNA or genomic library). 

15 [0323] Briefly, the steps in differential display are as follows: 1) RNA is isolated from biological sample of interest. 
Total RNA, cytoplasmic RNA or mRNA can be used. 2) first strand cDNAs are generated using an anchored oligodT 
(oligodTdN. where N is A, C, or G). 3) Amplification of cDNA using oligodTdN and short primers with arbitrary sequence. 
For a complete differential display analysis of two cell populations or two samples of interest. 9 different primers iare 
required. The detection limit of differential display for a specific mRNA is less than 0.00 1 % of the total mRNA population. 

20 [0324] Because of the simplicity, sensitivity, and reproducibility of the method disclosed here, the CMST differential 
display method is a significant advance over traditional gel based systems. With the CMST-based differential display 
analysis of two cell types, including 64 x 24 PCR runs can be completed rapidly as opposed to a labor intensive, lengthy 
time by the traditional method. Moreover, sequence heterogeneity of bands isolated from differential display gels has 
been found to be a contributing factor to the high failure rate of this technique, this is completely avoided with the 

25 CMST-based differential display methology described here. 

b. CMST-based Differential Display Example: 

[0325] The starting material for differential display is RNA isolated from two different populations of cells. Generally, 
30 the cells are of similar origin, and differ with respect to their treatment with drugs, their being ''normal" versus "trans- 
formed", or their expression of various introduced genes. 

[0326] Plant tissue or animal material (2-3 g) is harvested and minced or chopped in sterile petri dishes. The material 
Is then ground io a fine powder in a precooled pestle and mortar under liquid N2. 1 g of frozen powder is transferred 
to a 12 ml poly-propelene tubes (1 g = ca. 5 ml powder) containing 8 ml of a hot (80°C) 1 :1 mixture of RNA extraction 

35 buffer (100 mM Tris-HCI (pH 8.0), 100 mM LiCI. 10 mM EDTA. 1.0% LiDS) and phenol (base vol.: 4 ml). The sample 
is mixed (vortexed) at high speed for at least 30 seconds. A volume of chloroform is added (4 ml) and again mixed and 
spun for 20 mins in a centrifuge at 5000 to 10,000 x g. The aqueous phase is transfen-ed to a new 12 ml tube and 1/3 
volume of cold 8M LiCI is added to precipitate the RNA (3 h at 0°C). The RNA is centrifuged at 0°C for 20 mins and 
resuspended in I ml H2O. Residual genomic DNA is removed by treating the RNA sample with DNase 1. Reverse 

40 transcription is performed on each RNA sample using 500 ng of DNA free RNA in 1x reverse transcription buffer. 10 
mM DTT, 20 ^iM dNTPs. 0.2 jiM 5'RS H-T11C (one base anchored primer with 5* restriction site), 200 U MoMuLV 
reverse transcriptase and 1 .5U RNA Guard per 20 pJ reaction volume. 

[0327] Although using a downstream primer reduces the number of cDNA subtractions to three, it does not reduce 
the number of PCR reactions required to display most of the cDNA species present in the pool. On the contrary, It 
<5 decreases the theoretical chance of identifying those cDNA species which are present. The best results are obtained 

using a combination of nine different primers of the type DMO-W, where V can be A,G,C but not T. With a T in the 
terminal 3' position, incomplete hybridization of the primer leads to smearing of bands on the gels. The optimal con- 
centration of RNA is 200-300 ng per cDNA synthesis. 

[0328] CMST-based differential display is performed essentially as previously described (Liang and Pardee. Science 
50 257:967-971. 1992) except for the design of primers used for reverse transcription and amplification steps, and the 

choice of radiolabeled nucleotide. A complete differential display analysis of the cDNAs from two biological samples 

of interest using nine downstream primers and 24 upstream primers would generate 9 x 24 x 2 CMST-based differential 

display reactions Amplificafion products can be separated by HPLC and reamplified if desired. 

[0329] Following incubation of the RNA at 65°C for 5 minutes, samples are chilled on ice, added to the reverse 
55 transcription mix and Incubated for 60 minutes at 37*C followed by 95*C for 5 minutes. Duplicate cDNA samples are 

then amplified using the same 5'- primer in combination with a series of 1 3mers of arbitrary but defined sequence: 

H-AP: AAGCTTCGACTGT. H-AP: AAGCTTTGGTCAG. H-AP4: AAGCTTCTCAACG, H-AP5: AAGCTTAGTAGGC. 

[0330] Amplificatk)n is perfomied in reaction mixes containing 0. 1 x volume of reverse transcription reaction, 1 x PCR 
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buffer (10 x PCR buffer = 100 mM TRIS-HCI. 15 mM MgC12, 10 mM KCI pH 8.3) 2 pM dNTPs. 0.2^iM RS H-T11C 
anctiored primer, 0.2^M appropriate arbitrary primer, 1.5 U Expand™ high fidelity DNA polymerase, and water to a 
final volume of 20^l1. Amplification of the cDNA is performed under the following conditions: 94°C (1 minute) followed 
by 40 cycles of 94°C (30 seconds). 40°C (2 minutes), 72°C (30 seconds) and finished with 72°C (5 minutes). 

5 [0331] Amplification for each gene is performed with gene specific primers spanning a known intron/exon boundry 
(see below). All amplifications are done in 20 \i\ volumes containing 1 0 mM Tris HQ pH 8.3, 1 mM NH4CI. 1 .5 M MgC12, 
1 00 mM KCI, 0.125 mM NTP~s. 10 ng/mfof the respective dligbnudeotide primers and 0^75 units of Tag DNA polymerase 
(Gibco-BRL). Cycling parameters were 94°C preheating step for 5 minutes followed by 94°C denaturing step for I 
minute, SS^'C annealing step for 2 minutes, and a 72''C extension step for 30 seconds to 1 minute and a final extension 

10 at 72''C for 10 minutes. Amplification cycles are generally 30-45 in number. 

[0332] Amplification products are gel purified (Zhen and Swank. BioTechniques t4;:894-898. 1993) on 1% agarose 
gels run in 0.04 M Tris-acetate, 0.001 M EDTA (1x TEA) buffer and stained with ethidium bromide. A trough is cut just 
in front of the band of interest and filled with 50-200 ^1 of 10% PEG in 1x TAE buffer. Electrophoresis is continued until 
the band has completely entered the trough. The contents are then removed and extracted with phenol, cholorfonm 

15 extracted, and precipitated in 0.1 volume of 7.5 M ammonium acetate and 2.5 volumes of 100% EtOH. Samples are 
washed with 75% EtOH and briefly dried at ambient temperature. Quantitation of yield is done by electrophoresis of a 
small aliquot on 1% agarose gel in 1x TBE buffer with ethidium bromide staining and comparison to a known standard. 
[0333] The products from the amplification reactions are analyzed by HPLC. HPLC ias earned out using automated 
HPLC instrumentation (Rainin, Emeryville, CA., or Hewlett Packard, Palo Alto, CA). Unpurified DNA fingerprinting 

20 products which are denatured for 3 minutes at 95 prior into injection into an HPLC are eluted with linear acetonitrile 
(ACN. J.T. Baker, NJ) gradient of 1 .8%/minute at a flow rate of 0.9 ml/minute. The start and end points are adjusted 
according to the size of the amplified products. The temperature required for the successful resolution of the molecules 
generated during the DNA fingerprinting technique is 50**C. The effluent from the HPLC is then directed Into a mass 
spectrometer (Hewlett Packard, Palo Alto, CA) for the detection of tags. 

25 [0334] Comparison of the chromatograms (mass spectrometry-based) indicates that bands at 220 bp and 468 bp 
are observed in the stimulated Jurkat cells and not observed in the unstimulated Jurkat ceils. 

C. SEPARATION OF NUCLEIC ACID FRAGMENTS 

30 [0335] A sample that requires analysis is often a mixture of many components in a complex matrix. For samples 
containing unknown compounds, the components must be separated from each other so that each individual compo- 
nent can be identified by other analytical methods. The separation properties of the components in a mixture are 
constant under constant conditions, and therefore once determined they can be used to identify and quantify each of 
the components. Such procedures are typical In chromatographic and electrophoretic analytical separations. 

35 

1 High-Performance Liquid Chromatography (HPLC) 

[0336] High-performance liqukl chromatography (HPLC) is a chromatographic separations technique to separate 
compounds that are dissolved in solution. HPLC instruments consist of a resen^oir of mobile phase, a pump, an injector, 
40 a separation column, and a detector. Compounds are separated by injecting an aliquot of the sample mixture onto the 
column. The different components in the mixture pass through the column at different rates due to differences in their 
partitioning behavior between the mobile liquid phase and the stationary phase. 

[0337] Recently, IP-RO-HPLC on non-porous PS/DVB particles with chemically bonded alkyl chains have been 
shown to be rapid alternatives to capillary electrophoresis in the analysis of both single and double-strand nucleic acids 

45 providing similair degrees of resolution (Huber et al. Anal.Biochem. 212:351 , 1 993; Huber et al., 1 993. Nuc. Acids Res. 
2f:1061; Huber et al., Biotechniques 76:898, 1993). In contrast to ion-excahnge chromoatrography. which does not 
always retain double-strand DNA as a function of strand length (Since AT base pairs intereact with the positively charged 
stationary phase, more strongly than GC base-pairs). IP-RP-HPLC enables a strictly size-dependent separation. 
[0338] A method has been developed using 100 mM triethylammonium acetate as ion-pairing reagent, phosphodi- 

50 ester oligonucleotides could be successfully separated on alkylated non-porous 2.3 ^iM poly(styrene-divinylbenzene) 
particles by means of high performance liquid chromatography (Oefner et al., Anal. Biochem. 223:39, 1994). The 
technique described allowed the separation of PCR products differing only 4 to 8 base pairs in length within a size 
range of 50 to 200 nucleotides. 

55 2. Electrophoresis 

[0339] Electrophoresis is a separations technique that is based on the mobility of ions (or DNA as is the case de- 
scribed herein) in an electric field. Negatively charged DNA charged migrate towards a positive electrode and positively- 
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charged ions migrate toward a negative electrode. For safety reasons one electrode is usually at ground and the other 

is biased positively or negatively. Charged species have different migration rates depending on their total charge, size, 
and shape, and can therefore be separated. An electrode apparatus consists of a high-voltage power supply, electrodes, 
buffer, and a support for the buffer such as a polyacrylamide gel, or a capillary tube. Open capillary tubes are used for 
5 many types of samples and the other gel supports are usually used for biological samples such as protein mixtures or 
DMA fragments. 

3. Capillary Electrophoresis (CE) 

10 [0340] Capillary electrophoresis (CE) in its various manifestations (free solution, isotachophoresis, isoelectric focus- 
ing, polyacrylamide gel. micellar electrokinetic "chromatography") is developing as a method for rapid high resolution 
separations of very small sample volumes of complex mixtures. In combination with the inherent sensitivity and selec- 
tivity of MS. CE-MS is a potential powerful technique for bioanalysis. In the novel application disclosed herein, the 
interfacing of these two methods will lead to superior DNA sequencing methods that eclipse the cuaent rate methods 

IS of sequencing by several orders of magnitude. 

[0341] The correspondence between CE and electrospray ionization (ESI) flow rates and the fact that both are fa- 
cilitated by (and primarily used for) ionic species in solution provide the basis for an extremely attractive combination. 
The combination of both capillary zone electrophoresis (C2E) and capillary isotachophoresis with quadrapole mass 
spectrometers based upon ESI have been described (Olivares et al., Anal. Chem. 59:1230, 1987; Smith et al„ AnaL 

20 Chem. 60:436, 1988; Loo et al.. Anal. Chem. 179:404, 1989; Edmonds et al.. J. Chroma. 474:21 , 1989; Loo et al., J. 
Microcolumn Sep. f:223. 1989; Lee et al.. J. Chromatog. 45d:313. 1988; Smith et al.. J. Chromatog. 480:211, 1989; 
Grese et al.. J. Am. Chem. Soc. 111:2635, 1989). Small peptides are easily amenable to CZE analysts with good 
(femtomole) sensitivity. 

[0342] The most powerful separation method for DNA fragments is polyacrylamide gel electrophoresis (PAGE), gen- 
25 eraily in a slab gel format. However, the major limitation of the current technology is the relatively long time required 
to perfomi the gel electrophoresis of DNA firagments produced in the sequencing reactions. An increase magnitude 
(10-fbld) can be achieved with the use of capillary electrophoresis which utilize ultrathin gels. In firee solution to a first 
approximation all DNA migrate with the same mobility as the addition of a base results in the compensation of mass 
and charge. In polyacrylamide gels, DNA fragments sieve and migrate as a function of length and this approach has 
30 now been applied to CE, Remarkable plate number per meter has now been achieved with cross-linked: polyacrylamide 
(10"^ plates per meter, Cohen et al., Proc. Natl. Acad. Sc/„ USA 85:9660. 1988). Such CE columns as described can 
be employed for DNA sequencing. The method of CE Is in principle 25 times faster than slab gel electrophoresis in a 
standard sequencer. For example, about 300 bases can be read per hour. The separation speed is limited in slab gel 
electrophoresis by the magnitude of the electric field which can be applied to the gel without excessive heat production. 
35 Therefore, the greater speed of CE is achieved through the use of higher field strengths (300 V/cm in CE versus 10 
V/cm in slab gel electrophoresis). The capillary format reduces the amperage and thus power and the resultant heat 
generation. 

[0343] Smith and others (Smith etal., Nuc. Acids. Res. 1 8:441 7, 1990) have suggested employing multiple capillaries 
in parallel to increase throughput. Likewise, Mathies and Huang (Mathies and Huang, Nature 359:167, 1992) have 

^0 introduced capillary electrophoresis in which separations are performed on a parallel an-ay of capillaries and demon- 
strated high through-put sequencing (Huang et al.. Anal. Chem. 64:967. 1992, Huang et al.. Anal. Chem. 64:2149, 
1992). The major disadvantage of capillary electrophoresis is the limited amount of sample that can be loaded onto 
the capillary. By concentrating a large amount of sample at the beginning of the capillary, prbr to separation, loadability 
is increased, and detection levels can be lowered several orders of magnitude. The most popular method of precon- 

45 centration in CE is sample stacking. Sample stacking has recently been reviewed (Chien and Burgi, Anal. Chem. 64: 
489A, 1992). Sample stacking depends of the matrix difference, (pH, ionic strength) between the sample buffer and 
the capillary buffer, so that the electric field across the sample zone is more than in the capillary region. In sample 
stacking, a large volume of sample In a low concentration buffer is introduced for preconcentration at the head of the 
capillary column. The capillary is filled with a buffer of the same composition, but at higher concentration. When the 

50 sample ions reach the capillary buffer and the lower electric field, they stack into a concentrated zone. Sample stacking 
has increased detectabilities 1-3 orders of magnitude. 

[0344] Another method of preconcentration is to apply isotachophoresis (ITP) prior to the free zone CE separation 
of analytes. ITP is an electrophoretic technk^ue which allows microliter volumes of sample to be loaded on to the 
capillary, in contrast to the low nL injection volumes typically associated with CE. The technique relies on inserting the 
55 sample between two buffers (leading and trailing electrolytes) of higher and lower mobility respectively, than the analyte. 
The technique is inherently a concentration technique, where the analytes concentrate into pure zones migrating with 
the same speed. The technique is currently less popular than the stacking methods desaibed above because of the 
need for several choices of leading and trailing electrolytes, and the ability to separate only cationic or anionic spedes 



60 



EP 0 990 047 B1 



during a separation process. 

[0345] The heart of the DNA sequencing process is the remarkably selective electrophoretic separation of DNA or 
oligonucleotide fragments. It is remarkable because each fragment is resolved and differs by only nucleotide. Sepa- 
rations of up to 1000 fragments (1000 bp) have been obtained. A further advantage of sequencing with deavable tags 

s is as follows. There is no requirement to use a slab gel format when DNA fragments are separated by polyacrylamide 
gel electrophoresis when deavable tags are employed. Since numerous samples are combined (4 to 2000) there is 
no need to run' samples in parallel as is the case with cun-ent dyei}rimer or dye-terminator methods-f/.e., ABI373 
sequencer). Since there is no reason to run parallel lanes, there is no reason to use a slab gel. Therefore, one can 
employ a tube gel format for the electrophoretic separation method. Grossman (Grossman et al., Genet. Anal. Tech. 

10 Appl. 9:9, 1992) have shown that considerable advantage is gained when a tube gel format is used in place of a slab 
gel format. This is due to the greater ability to dissipate Joule heat in a tube format compared to a slab gel which results 
in faster run times (by 50%), and much higher resolution of high molecular weight DNA fragments (greater than 1000 
nt). Long reads are critical in genomic sequendng. Therefore, the use of deavable tags in sequencing has the additional 
advantage of allowing the user to employ the most efficient and sensitive DNA separation method which also possesses 

^5 the highest resolution. 

4. Microfabricated Devices 

[0346] Capillary electrophoresis (CE) is a powerful method for DNA sequencing, forensic analysis, PGR product 

20 analysis and restriction fragment sizing. CE is far faster than traditional slab PAGE since with capillary gels a far higher 
potential field can be applied. However. CE has the drawback of allowing only one sample to be processed per gel. 
The method combines the faster separations times of CE with the ability to analyze multiple samples in parallel. The 
underlying concept behind the use of microfabricated devices is the ability to increase the information density in elec- 
trophoresis by miniaturizing the lane dimension to alx)ut 100 micrometers. The electronics industry routinely uses 

25 microfabrication to make circuits with features of less than one micron in size. The current density of capillary arrays 
is limited the outside diameter of the capillary tube. Microfabrication of channels produces a higher density of arrays. 
Microfabrication also permits physical assemblies not possible with glass fibers and links the channels directly to other 
devices on a chip. Few devices have been constructed on microchips for separation technologies. A gas chromatograph 
(Terry et al., IEEE Trans. Electron Device. EO-26: 1880, 1979) and a liquid chromatograph (Manz et al., Sens. Actuators 

30 8f :249. 1990) have been fabricated on silicon chips, but these devices have not been widely used. Several groups 
have reported separating fluorescent dyes and amino acids on microfabricated devices (Manz et al,, J. Chromatography 
593:253. 1 992, Effenhauser et al.. Anal. Chem. 65:2637, 1 993). Recently Woolley and Mathies (Woolley and Mathies, 
Proc. Natl. Acad. Sci. 9f:11348, 1994) have shown that photolithography and chemical etching can be.used to make 
large numbers of separation channels on glass substrates. The channels are filled with hydroxyethyl cellulose (HEC) 

35 separation matrices. It was shown that DNA restriction fragments could be separated in as little as two minutes. 

D. CLEAVAGE OF TAGS 

[0347] As described above, different linker designs will confer cleavabllity ("lability") under different spedfic physical 
40 or chemical conditions. Examples of conditions which serve to deave various designs of linker Include acki, base, 
oxidation, reduction, fluoride, thiol exchange, photolysis, and enzymatic conditions. 

[0348] Examples of deavable tinkers that satisfy the general criteria for linkers listed above will be well known to 
those in the art and indude those found in the catalog available from Pierce (Rockford, IL). Examples indude: 

45 • ethylene glycobis(succinimidylsuccinate) (EGS). an amine reactive cross-linking reagent which Is deavable by 
hydroxylamine (1 M at 37°C for 3-6 hours); 

• disuccinimidyl tartarate (DST) and sulfo-DST, which are amine reactive aoss-linking reagents, deavable by 0!01 5 
M sodium periodate; 

• bis[2-(succinlmidyloxycarbonyloxy)ethyl]sulfone (BSOCOES) and sulfo-BSOCOES, which are amine reactive 
50 cross-linking reagents, deavable by base (pH 11.6): 

• 1,4-di-(3'-(2'-pyridyldithio(propionamido))butane (DPDPB), a pyridyldithiol crosslinker which is deavable by thiol 
exchange or reduction; 

N-(4-(p-azidosaIicylamido)-butyl]-3*-(2'-pyridydithio)propionamide (APDP), a pyridyldithiol crosslinker which is 
deavable by thiol exchange or reduction; 
55 • bis-(beta-4-(azldosalicylamido)ethyl]-disulfide, a photoreactive crosslinker which is deavable by thk)l exchange or 
reduction; 

• N-succinimidyi-(4-azidophenyl)-1 .3'dithiopropionate (SADP), a photoreactive crosslinker vi/hich is deavable by thi- 
ol exchange or reduction; 
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• suIfosuccinimidyl-2-(7-azidcHi-melhylcoumarin-3-acetamide)ethyl-1,3'-dithi (SAED), a photoreactive 
crosslinker which is cleavable by thiol exchange or reduction; 

• sulfosuccinimidyl-2-(m-azido-o-nitrobenzamido)-ethyl-1,3'dithiopropionate (SAND), a photoreactive crosslinker 
which is cleavable by thiol exchange or reductk^n. 

5 

[0349] Other examples of j^ieavable linkers and the cleavage conditions that can be used to release tags are as 
follows. A silyl linking group can be cleaved by fluoride or under acidic condiSons. A 3-74-, 5-, or 6-substituted-2-ni- 
trobenzyloxy or 2-. 3-, 5-, or 6-substituted-4-nitrobenzyloxy linking group can be cleaved by a photon source (photol- 
ysis). A 3-. 4-. 5-, or 6-substituted-2-alkoxyphenoxy or 2-. 3-, 5-, or 6-substituted-4-alkoxyphenoxy linking group can 

10 be cleaved by Ce(NH4)2{N03)6 (oxidation). A NCO2 (urethane) linker can be deaved by hydroxide (base), acid, or 
LiAIH4 (reduction). A 3-pentenyi, 2-butenyl. or 1-butenyl linking group can be cleaved by O3. O5O4/IO4-. or KMn04 
(oxidation), A 2-[3-, 4-, or 5-substituted-furyl]oxy linking group can be cleaved by O2, Br2. MeOH, or acid. 
[0350] Conditions for the cleavage of other labile linking groups include: t-alkyloxy linking groups can be cleaved by 
acid; methyl(dialkyl)methoxy or 4-substituted-2-alkyl-1,3-dioxlane-2-yl linking groups can be cleaved by H3O*; 2-sl- 

15 lylethoxy linking groups can be cleaved by fluorkJe or acid; 2-(X)-ethoxy (where X = keto, ester amide, cyano, NO2, 
sulfide, sulfoxide, sulfone) linking groups can be dedved under alkaline conditions; 2-, 3-, 4-. 5-, or 6-substituted- 
benzyloxy linking groups can be cleaved by acid or under reductive conditions; 2-butenyloxy linking groups can be 
deaved by (Ph3P)3RhCI(H). 3-. 4-, 5-, or 6-substituted-2-bromophenoxy linking groups can be deaved by Li, Mg, or 
BuLi; methylthiomethoxy linking groups can be deaved by Hg2-; 2-{X)-ethyioxy (where X = a halogen) linking groups 

20 can be cleaved by Zn or Mg; 2-hydroxyethyloxy linking groups can be deaved by oxidation (e.g., with Pb(OAc)4), 
[0351] Prefen-ed linkers are those that are cleaved by acid or photolysis. Several of the acid-labile linkers that have 
been developed for solid phase peptide synthesis are useful for linking tags to MOIs, Some of these linkers are de- 
scribed in a recent review by Uoyd-Williams et al. (Tetrahedron 49:11065-11133, 1993). One useful type of linker is 
based upgn p-alkoxybenzyl alcohols, of which two, 4-hydroxymethylphenoxyacetic acid and 4-(4-hydroxymethyl- 

25 3-methoxyphenoxy)butyric acid, are commercially available from Advanced ChemTech (Louisville, KY), Both linkers 
can be attached to a tag via an ester linkage to the benzylalcohol, and to an amine-containing MOt via an amide linkage 
to the carboxylic add. Tags linked by these molecules are released from the MOI with varying concentrations of trif- 
luoroacetic acid. The deavage of these linkers results in the liberation of a carboxylic acid on the tag. Acid cleavage 
of tags attached through related linkers, such as 2,4-dimethoxy-4 -(cartx)xymethyloxy)-benzhydrylamine (available 

30 from Advanced ChemTech in FMOC-protected form), results In liberation of a carboxylic amide on the released tag. 
[0352] The photolabile linkers useful for this application have also been for the most part developed for solid phase 
peptide synthesis (see Lloyd-Williams review). These linkers are usually based on 2-nitrobenzylesters or 2-nitroben- 
zylamides. Two examples of photolabile linkers that have recently been reported in the literature are 4-(4-(1-Fmoc- 
amino)ethyl)-2-methoxy-5-nitrophenoxy)butanoic acid (Holmes and Jones, J. Org. Chem. 60:2318-2319, 1995) and 

35 3-(Fmoc-amino)-3-(2-nitrophenyl)propionic acid (Brown et al,. Molecular Diversity f :4-12, 1 995). Both linkers can be 
attached via the carboxylic acid to an amine on the MOI. The attachment of the tag to the linker is made by forming 
an amide between a cart>oxylic acid on the tag and the amine on the linker. Cleavage of photolabile linkers is usually 
perfonned with UV light of 350 nm wavelength at Intensities and times known to those in the art. Exam pies of commercial 
sources of instruments for photochemical deavage are Aura Industries Inc. (Staten Island, NY) and Agrenetics (Wilm- 

40 ington, MA). Cleavage of the linkers results in liberation of a primary amide on the tag. Examples of photocleavable 
linkers indude nitrophenyl glycine esters, exo- and endo-2-benzonort3omeyl chlorides and methane sulfonates, and 
3-amino-3(2-nitrophenyl) propionic ackJ. Examples of enzymatic deavage indude esterases which will deave ester 
bonds, nudeases which will deave phosphodlester bonds, proteases which deave peptide bonds, etc. 
[0353] Suitable devices which may be used to perform photodeavage of a tagged molecules include the device 

45 known by the acronym "PHRED", which stands for Photochemical Reactor for Enhanced Detection, and is available 
from Aura Industries, Staten Island, NY (available with boih a 254 nm and 366 nm bulb), and PhotoBlaster System - 
1 with LuxTube assembly, available from Agrenetics. 81 Salem Street Wilmington. MA USA 01887 (available with a 
366 nm bulb, but the tubing contains a photocatalyst that is activated by 366 nm light, resulting in emmision of a range 
of wavelengths induding 254nm). 

50 

E. DETECTION OF TAGS 



[0354] Detection methods typically rely on the absorption and emission in some type of spectral field. When atoms 
or molecules absorb light, the incoming energy excites a quantized structure to a higher energy level. The type of 
excitation depends on the wavelength of the light. Electrons are promoted to higher orbitals by ultraviolet or visible 
light, molecular vibrations are excited by infrared light, and rotations are excited by microwaves. An absorption spectnjm 
is the absorption of light as a function of wavelength. The spectrum of an atom or molecule depends on its energy level 
structure. Absorption spectra are useful for identification of compounds. Specific absorption spectroscopic methods 
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include atomic absorption spectroscopy (AA), infrared spectroscopy (IR), and UV-vis spectroscopy (uv-vis). 
[0355] Atoms or molecules ttiat are excited to high energy levels can decay to lower levels by emitting radiation. 
This light emission is called fluorescence if the transition is between states of the same spin, and phosphorescence if 
the transition occurs between states of different spin. The emission intensity of an analyte is linearly proportional to 
5 concentration (at low concentrations), and is useful for quantifying the emitting species. Specific emission spectroscopic 
methods include atomic emission spectroscopy (AES), atomic fluorescence spectroscopy (AFS). molecular laser-in- 
duced fluorescence (LIF), and X-ray fluorescence (XRF). 

[0356] When electromagnetic radiation passes through matter, most of the radiation continues in its original direction 
but a small fraction is scattered in other directions. Light that is scattered at the same wavelength as the incoming tight 
10 Is called Rayleigh scattering. Light that is scattered in transparent solids due to vibrations (phonons) is called Brillouin 
scattering. Brillouin scattering is typically shifted by 0.1 to 1 wave number from the Incident light Light that is scattered 
due to vibrations in molecules or optical phonons in opaque solids is called Raman scattering. Raman scattered light 
is shifted by as much as 4000 wavenumbers from the incident light Specific scattering spectroscopic methods include 
Raman spectroscopy. 

15 [0357] IR spectroscopy is the measurement of the wavelength and intensity of the absorption of mid-infrared light 
by a sample. Mid-infrared light (2.5 - '50 ^m, 4000 - 200 cm*^) is energetic enough to excite molecular vibrations to 
higher energy levels. The wavelength of IR absorption bands are characteristic of specific types of chemical bonds 
and IR spectroscopy is generally most useful for identification of organic and organometallic molecules. 
[0358] Near-infrared absorption spectroscopy (NIR) is the measurement of the wavelength and intensity of the ab- 

20 sorption of near-infrared light by a sample. Near-infrared light spans the 800 nm - 2.5 fim (12.500 - 4000 cm-^) range 
and Is energetic enough to excite overtones and combinations of molecular vibrations to higher energy levels. NIR 
spectroscopy is typically used for quantitative measurement of organic functional groups, especially 0-H, N-H, and 
C=0. The components and design of NIR instrumentation are similar to uv-vis absorption spectrometers. The light 
source is usually a tungsten lamp and the detector is usually a PbS solid-state detector. Sample holders can be glass 

25 or quartz and typical solvents are CCI4 and CS2. The convenient instrumentation of NIR spectroscopy makes it suitable 
for on-line monitoring and process control. 

[0359] Ultraviolet and Visible Absorption Spectroscopy (uv-vis) spectroscopy is the measurement of the wavelength 
and Intensity of absorption of near-ultraviolet and visible light by a sample. Absorption In the vacuum UV occurs at 
100-200 nm; (105-50,000 cm-^) quartz UV at 200-350 nm; (50.000-28,570 cm-i) and visible at 350-800 nm; 

30 (28.570-12,500 cm"^) and is described by the Beer-Lambert-Bouguet law. Ultraviolet and visible light are energetic 
enough to promote outer electrons to higher energy levels, UV-vis spectroscopy can be usually applied to molecules 
and inorganic Ions or complexes In solution. The uv-vis spectra are limited by the broad features of the spectra. The 
light source is usually a hydrogen or deuterium lamp for uv measurements and a tungsten lamp for visible measure- 
ments. The wavelengths of these continuous light sources are selected with a wavelength separator such as a prism 

35 or grating monochromator. Spectra are obtained by scanning the wavelength separator and quantitative measurements 
can be made from a spectrum or at a single wavelength. 

[0360] Mass spectrometers use the difference In the mass-to-charge ratio (m/z) of Ionized atoms or molecules to 
separate them from each other. Mass spectrometry is therefore useful for quantitation of atoms or molecules and also 
for determining chemical and structural information about molecules. Molecules have distinctive fragmentation patterns 

40 that provide structural information to identify compounds. The general operations of a mass spectrometer are as follows. 
Gas-phase ions are created, the ions are separated in space or time based on their mass-to-charge ratio, and the 
quantity of ions of each mass-to-charge ratio Is measured. The Ion separation power of a mass spectrometer is de- 
scribed by the resolution, which is defined as R = m / delta m, where m is the ion mass and delta m is the difference 
in mass between two resolvable peaks in a mass spectrum. For example, a mass spectrometer with a resolution of 

45 1000 can resolve an ion with a m/z of 100.0 from an ion with a m/z of 100.1 . 

[0361] In general, a mass spectrometer (MS) consists of an ion source, a mass-selective analyzer, and an ion de- 
tector. The magnetic-sector, quadrupole, and time-of-flight designs also require extraction and acceleration ion optics 
to transfer ions from the source region into the mass analyzer. The details of several mass analyzer designs (for 
magnetic-sector MS, quadrupole MS or time-of-flight MS) are discussed below. Single Focusing analyzers for magnetic- 

50 sector MS utilize a particle beam path of 180. 90, or 60 degrees. The various forces influencing the particle separate 
ions with different mass-to-charge ratios. With double-focusing analyzers, an electrostatic analyzer is added In this 
type of Instalment to separate particles with difference in kinetic energies. 

[0362] A quadrupole mass filter for quadrupole MS consists of four metal rods arranged In parallel. The applied 
voltages affect the trajectory of ions traveling down the flight path centered between the four rods. For given DC and 
55 AC voltages, only ions of a certain mass-to-charge ratio pass through the quadrupole filter and all other ions are throvim 
out of their original path. A mass spectrum Is obtained by monitoring the ions passing through the quadrupole filter as 
the voltages on the rods are varied. 

[0363] A time-of-flight mass spectrometer uses the differences in transit time through a "drift region" to separate ions 
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of different masses. It operates in a pulsed mode so ions must be produced in pulses and/or extracted In pulses. A 
pulsed electric field accelerates all ions into a field-free drift region with a kinetic energy of qV. where q is the ion charge 
and V is the applied voltage. Since the ion kinetic energy is 0.5 mV^, lighter ions have a higher velocity than heavier 
ions and reach the detector at the end of the drift region sooner. The output of an ion detector Is displayed on an 

5 oscilloscope as a function of time to produce the mass spectrum. 

[0364] The ion formation process is the starting point for mass spectrometric analyses. Chemical ionization Is a 
method that employs a7eagent ion to react with the arialyte'mbleculies (tags) to form ions by either a proton or hydride 
transfer. The reagent ions are produced by introducing a large excess of methane (relative to the tag) into an electron 
impact (El) ion source. Electron collisions produce CH4' and CH3* which further react with methane to form CHs" and 

10 C2H5*. Another method to ionize tags is by plasma and glow discharge. Plasma is a hot. partially-ionized gas that 
effectively excites and Ionizes atoms. A glow discharge is a low-pressure plasma maintained between two electrodes, 
□ectron impact Ionization employs an electron beam, usually generated from a tungsten filament, to ionize gas-phase 
atoms or molecules. An electron from the beam knocks an electron off analyte atoms or molecules to create ions. 
Electrospray ionization utilizes a very fine needle and a series of skimmers. A sample solution is sprayed into the 

15 source chamber to form droplets. The droplets carry charge when the exit the capillary and as the solvent vaporizes 
the droplets disappear leaving highly charged analyte molecules. ESI is particulariy useful for large biological molecules 
that are difficult to vaporize or ionize. Fast-atom bombardment (FAB) utilizes a high-energy beam of neutral atoms, 
typically Xe or Ar, that strikes a solid sample causing desorption and ionization. It is used for large biological molecules 
that are difficult to get into the gas phase. FAB causes little fragmentation and usually gives a large molecular ion peak, 

20 making it useful for molecular weight determination. The atomic beam is produced by accelerating ions from an ion 
source though a charge-exchange cell. The ions pick up an electron in collisions with neutral atoms to form a beam of 
high energy atoms. Laser ionization (LIMS) is a method in which a laser pulse ablates material from the surface of a 
sample and creates a microplasma that ionizes some of the sample constituents. Matrix-assisted laser desorption 
ionization (MALDl) is a LIMS method of vaporizing and ionizing large biological molecules such as proteins or DNA 

25 fragments. The biological molecules are dispersed in a solid matrix such as nicotinic acid. A UV laser pulse ablates 
the matrix which carries some of the large molecules into the gas phase in an ionized forni so they can be extracted 
into a mass spectrometer. Plasma-desorption ionization (PD) utilizes the decay of 252cf which produces two fission 
fragments that travel in opposite directions. One fragment strikes the sample knocking out 1 -1 0 analyte ions. The other 
fragment strikes a detector and triggers the start of data acquisition. This ionization method is especially useful for 

30 large biological molecules. Resonance ionization (RIMS) is a method in which one or more laser beams are tuned in 
resonance to transitions of a gas-phase atom or molecule to promote it in a stepwise fashion above its ionization 
potential to create an ion. Secondary ionization (SIMS) utilizes an ion beam; such as^He*. ^^O*. or^^Ar*^; is focused 
onto the surface of a sample and sputters material into the gas phase. Spark source is a method which Ionizes analytes 
in solid samples by pulsing an electric cun-ent across two electrodes. 

35 [0365] A tag may become charged prior to, during or after cleavage from the molecule to which it is attached. Ioni- 
zation methods based on ion "desorption", the direct formation or emission of ions from solid or liquid surfaces have 
allowed increasing application to nonvolatile and thermally labile compounds. These methods eliminate the need for 
neutral molecule volatilization prior to ionization and generally minimize thermal degradation of the molecular species. 
These methods include field desorption (Becky, Principles of Field Ionization and Field Desorption Mass Spectrometry, 

40 Pergamon, Oxford, 1977), plasma desorption (Sundqvist and Macfariane, Mass Spectrom. Rev, 4:421, 1985), laser 
desorption (Karas and Hillenkamp, A)a/. Chem. 60:2299, 1988; KarasetaL. A/7gew. Chem, f 0t:805, 1989). fast particle 
bombardment (e.g., fast atom bombardment, FAB, and secondary ton mass spectrometry, SIMS, Barber et aL. Anal. 
Chem. 54:645A, 1982), and themnospray (TS) ionization (Vestal, Mass Spectrom. Rev. 2:447, 1983), Thermospray is 
broadly applied for the on-line combination with liquid chromatography. The continuous flow FAB methods (Caprioli et 

45 a!., Anal. Chem. 58:2949. 1986) have also shown significant potential. A more complete listing of ionization/mass 
spectrometry combinations is ion-trap mass spectrometry, electrospray ionization mass spectrometry, ion-spray mass 
spectrometry, liquid ionization mass spectrometry, atmospheric pressure ionization mass spectrometry, electron ioni- 
zation mass spectrometry, metastable atom bombardment ionization mass Spectrometry, fast atom bombard ionization 
mass spectrometry, MALDl mass spectrometry, photo-ionization time-of-flight mass spectrometry, laser droplet mass 

50 spectrometry, MALDI-TOF mass spectrometry. APCI mass spectrometry, nano-spray mass spectrometry, nebulised 
spray ionization mass spectrometry, chemical ionization mass spectrometry, resonance ionization mass spectrometry, 
secondary ionization mass spectrometry, themiospray mass spectrometry, 

[0366] The ionization methods amenable to nonvolatile biological compounds have overiapping ranges of applica- 
bility. Ionization efficiencies are highly dependent on matrix composition and compound type. Cun-ently available results 
55 indicate that the upper molecular mass for TS Is about 8000 daltons (Jones and Krolik. Rapid Comm. Mass Spectrom. 
Y:67, 1987). Since TS is practiced mainly with quadrapole mass spectrometers, sensitivity typically suffers disporpor- 
tionately at higher mass-to-charge ratios (m/z). Time-of-flight (TOF) mass spectrometers are commercially available 
and possess the advantage that the nn/z range is limited only by detector efficiency. Recently, two additional ionization 
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methods have been introduced. These two methods are now referred to as matrix-assisted laser desorption (MALDI, 
Karas and Hillenkamp. Anal. Chem. 60:2299. 1988; Karas et al., Angew. Chem 101:805, 1989) and electrospray ion- 
ization (ESI). Both methodologies have very high ionization efficiency {i.e., very high [molecular ions produced]/[mol- 
ecules consumed]). Sensitivity, which defines the ultimate potential of the technique, is dependent on sample size, 

5 quantity of ions, flow rate, detection efficiency and actual ionization efficiency. 
- - . _[0367] Electrpspray-MS is based on _an idea first proposed in the 1 960s (Dole et al., J. Chem. Phys. 49:2240, 1 968). 
Electrospray ionization (ESI) is one means to produce charged molecules foF analysis by mass spectroscopy. Briefly, 
electrospray ionization produces highly charged droplets by nebulizing liquids in a strong electrostatic field. The highly 
charged droplets, generally fomied in a dry bath gas at atmospheric pressure, shrink by evaporation of neutral solvent 

to until the charge repulsion overcomes the cohesive forces, leading to a "Couiombic explosion". The exact mechanism 
of Ionization is controversial and several groups have put forth hypotheses (Blades et al., AnaL Chem. 63:2109-14. 
1 991 : Kebarie et al.. Anal. Chem. 65:A972-86. 1 993; Fenn, 1 Am. Soc. Mass. Spectmm. 4:524-35. 1 993). Regardless 
of the ultimate process of ion formation, ESI produces charged molecules from solution under mild conditions. 
[0368] The ability to obtain useful mass spectral data on small amounts of an organic molecule relies on the efficient 

15 production of ions. The efficiency of ionization for ESI is related to the extent of positive charge associated with the 
molecule. Improving ionization experimentally has usually involved using acidic conditions. Another method to improve 
ionization has been to use quaternary amines when possible (see Aebersold et al.. Protein Science y:494-503. 1992; 
Smith et al., AnaL Chem. 60:436-41. 1988). 

[0369] Electrospray ionization is described in more detail as follows. Electrospray ion production requires two steps: 
20 dispersal of highly charged droplets at near atmospheric pressure, followed by conditions to induce evaporation. A 
solution of analyte molecules is passed through a needle that is kept at high electric potential. At the end of the needle, 
the solution disperses into a mist of small highly charged droplets containing the analyte molecules. The small droplets 
evaporate quickly and by a process of field desorption or residual evaporation, protonated protein molecules are re- 
leased into the gas phase. An electrospray is generally produced by application of a high electric field to a small flow 
25 of liquid (generally 1 -1 0 uL/min) from a capillary tube. A potential difference of 3-6 kV is typically applied between the 
capillary and counter electrode located 0.2-2 cm away (where tons, charged clusters, arid even charged droplets, 
depending on the extent of desolvatkni, may be sampled by the MS through a small orifice). The electric field results 
in charge accumulation on the liquid surface at the capillary terminus; thus the liquid flow rate, resistivity, and surface 
tension are important factors in droplet production. The high electric field results in disruption of the liquid surface and 
30 formation of highly charged liquid droplets. Positively or negatively charged droplets can be produced depending upon 
the capillary bias. The negative Ion mode requires the presence of an electron scavenger such as oxygen to Inhibit 
electrical discharge. 

[0370] A wide range of liquids can be sprayed electrostatically into a vacuum, or with the aid of a nebulizing agent. 
The use of only electric fields for nebulization leads to some practical restrictions on the range of liquid conductivity 

35 and dielectric constant. Solution conductivity of less than 10-5 ohms is required at room temperature for a stable elec- 
trospray at useful liquid flow rates con-esponding to an aqueous electrolyte solution of < 10-^ M. In the mode found 
most useful for ESI-MS, an appropriate liquid flow rate results in dispersion of the liquid as a fine mist. A short distance 
from the capillary the droplet diameter is often quite uniform and on the order of 1 jim. Of particular importance is that 
the total electrospray ion current increases only slightly for higher liquid flow rates. There is evidence that heating is 

^ useful for manipulating the electrospray. For example, slight heating allows aqueous solutions to be readily electro- 
sprayed, presumably due to the decreased viscosity and surface tension. Both thermally-assisted and gas-nebulization- 
assisted electrosprays allow higher liquid flow rates to be used, but decrease the extent of droplet charging. The for- 
mation of molecular ions requires conditions effecting evaporation of the initial droplet population. This can be accom- 
plished at higher pressures by a flow of dry gas at moderate temperatures (<60°C), by heating during transport through 

^5 the interface, and (particularly in the case of ion trapping methods) by energetic collisions at relatively low pressure. 
[0371] Although the detailed processes underiying ESI remain uncertain, the very small droplets produced by ESI 
appear to allow almost any species carrying a net charge in solution to be transferred to the gas phase after evaporation 
of residual solvent. Mass spectrometric detection then requires that ions have a tractable m/z range (<4000 daltons 
for quadrupole instruments) after desolvatlon, as well as to be produced and transmitted with sufficient efficiency. The 

50 wide range of solutes already found to be amenable to EST-MS, and the lack of substantial dependence of ionization 
efficiency upon molecular weight, suggest a highly non-discriminating and broadly applicable ionization process. 
[0372] The electrospray ion "source" functions at near atmospheric pressure. The electrospray "source" is typically 
a metal or glass capillary incorporating a method for electrically biasing the liquid solution relative to a counter electrode. 
Solutions, typically water-methanol mixtures containing the analyte and often other additives such as acetic acid, flow 

55 to the capillary terminus. An ESI source has been described (Smith et al., Anat. Chem. 62:885, 1990) which can ac- 
commodate essentially any solvent system. Typical flow rates for ESI are 1-10 uUmin. The principal requirement of 
an ESI-MS interface is to sample and transport ions from the high pressure region into the MS as efficiently as possible. 
[0373] The efficiency of ESI can be very high, providing the basis for extremely sensitive measurements, which is 
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useful for the invention described herein. Current instnjmental performance can provide a total ion current at the de- 
tector of about 2 X 10-^2 a or about 10^ counts/s for singly charged species. On the basis of the instrumental perform- 
ance, concentrations of as low as 10-^° M or about 10*^^ mol/s of a singly charged species will give detectable ion 
current (about 10 counts/s) if the analyte is completely ionized. For example, low attomole detection limits have been 

5 obtained for quaternary ammonium ions using an ESi interface with capillary zone electrophoresis (Smith et al.. Anal. 
Chem, 59:1230, 1J98_8)._For a cgnipqund of mo[ecutar weight of 1000, the average number of charges is 1 . the approx- 
imate number of charge states is 1 , peak width (m/z) is 1 and the maximum intensity (ion/s) isl x 10^^^ 
[0374] Remarkably little sample is actually consumed in obtaining an ESI mass spectrum (Smith et al., AnaL Chem. 
60:1 948, 1 988). Substantial gains might be also obtained by the use of array detectors with sector instruments, allowing 

10 simultaneous detection of portions of the spectrum. Since currently only about 10"^ of all ions formed by ESI are de- 
tected, attention to the factors limiting instrument performance may provide a basis for improved sensitivity. It will be 
evident to those in the art that the present invention contemplates and accommodates for improvements in ionization 
and detection methodologies. 

[0375] An interface is preferably placed between the separation instrumentation (e.g., gel)and the detector (e.g., 
15 mass spectrometer). The interface preferably has the following properties: (1 ) the ability to collect the DN A fragments 
at discreet time intervals, (2) concentrate the DNA fragments. (3) remove the DNA fragments from the electrophoresis 
buffers and milieu, (4) cleave the tag from the DNA fragment, (5) separate the tag from the D^4A fragment, (6) dispose 
of the DNA fragment, (7) place the tag in a volatile solution, (8) volatilize and ionize the tag, and (9) place or transport 
the tag to an electrospray device that introduces the tag into mass spectrometer, 
20 [0376] The interface also has the capability of "collecting" DNA fragments as they elute from the bottom of a gel. 
The gel may be composed of a slab gel, a tubular gel, a capillary, etc. The DNA fragments can be collected by several 
methods. The first method is that of use of an electric field wherein DNA fragments are collected onto or near an 
electrode. A second method is that wherein the DNA fragments are collected by flowing a stream of liquid past the 
bottom of a gel. Aspects of both methods can be combined wherein DNA collected into a flowing streain which can be 
25 later concentrated by use of an electric field. The end result is that DNA fragments are removed from the milieu under 
which the separation method was performed. That is, DNA fragments can be "dragged** from one solution type to 
another by use of an electric field. 

[0377] Once the DNA fragments are in the appropriate solution (compatible with electrospray and mass spectrometry) 
the tag can be cleaved from the DNA fragment. The DNA fragment (or remnants thereof) can then be separated from 

30 the tag by the application of an electric field (preferably, the tag is of opposite charge of that of the DNA tag). The tag 
is then introduced into the electrospray device by the use of an electric field or a flowing liquid. 
[0378] The detection device is preferably a mass spectrometer. Because mass spectrometers use the difference in 
the mass-to-charge ratio (m/e) of ionized species to specifically identity molecules, this detection technique is useful 
for quantitation of small molecules and also for determining chemical and structural information about molecules. Some 

35 molecules have distinctive fragmentation pattems that can provide information to identify structural components. How- 
ever, for the use described herein, the MSD is essentially used as a "array-detector* for the detection, measurement 
and quantitation of tags of known molecular weight. Thus, a mass spectrometer may be employed much as scientists 
currently use a diode-array detector in a UV/VIS spectrometer to measure small molecules with known extinction 
coefficients. In the application described herein, the tags are used to identify the presence or absence of specific nucleic 

40 acid sequence and map sample identification, 

[0379] A quadrupole mass detector consists of four parallel metal rods. Two opposite rods have an applied potential 
of (U't'Vcos(wt)) and the other two rods have a potential of -(U Wcos(wt)), where U is a dc voltage and Vcos(wt) is an 
ac voltage. The applied voltages affect the trajectory of Ions traveling down the flight path centered between the four 
rods. For given dc and ac voltages, only Ions of a certain mass-to-charge ratio pass through the quadrupole filter and 

45 all other ions are thrown out of their original path. A mass spectrum is obtained by monitoring the ions passing through 
the. quad mpole filter as the voltages on the rods are varied. The ion separation power of a mass spectrometer is 
described by the resolution which is the difference in mass between two resolvable peaks in a mass spectrum. That 
is. a mass spectrometer with a resolution of 1000 can resolve an ion with a m/e of 100.0 from an ion with an m/e of 
100.1. The general operation of a mass spectrometer is to first create gas-phase ions, second, separate the ions in 

50 space or time based on their mass-to-charge ratio, and third, measure the quantity of ions of each mass-to-charge ratio. 
[0380] The mass spectrometer is ideally suited as a spectrometer for applications in genomics as it permits the 
simultaneous measurement of hundreds of tags. The cun*ent number of tags used in genomics applications (sequenc- 
ing, mapping, genotyping) is about 4, which results from the overiapping emission spectrum of fluorescent tags which 
can be placed between 300 nm and 700 nm. In contrast, with cun-ent quadrapole instruments (such as the Micromass 

55 MS, the Hewlett Packard LC/MSD 1 1 00, the PE Sciex AP1 1 65 LC/MS, or the Finnigan Navigator) about 400 tags can 
be placed in the spectra of 50-3000 amu. The MS insruments have at least 0.1 amu resolution. The new measurement 
system we describe (i.e., tagged biomolecules) can be used in conjunction with almost all commercial mass spectrom- 
eters and HPLC systems with little modification. Ideally, a software package will be apended to the "driver* software 
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to elaborate the molecular biological, genetic or genomic applications. Suitable softward packages are described in, 
e.g., U.S. Provisional Patent Application No. 60/053.429, filed July 22, 1997. 

[0381] Atmospheric pressure chemical ionization (APCI) can applied to a variety of tag types and Is generally used 
to enhance the abundance of the molecular ion (molecules rarely form adducts during APCI), The usable molecular 
5 weight range of APCI is 50 to 3000 amu (in general). Accuracy of the mass measurement at low resolving power is 
- - - -0.1 amu and in the high resolution mode, 5 ppm^ APCI uses a reagent ion to r 

ions by either a proton or hydride transfer. Currently, only about 10"* to 10"^ of all ions formed by APCI are detected.' 
This important parameter (which limits instrument performance) may provide a basis for improved sensitivity. This 
ionization technique is a continuous method that is suitable for using as an interface with HPLC or capillary electro- 
10 phoresis. 

[0382] There are alternative forms of ionization that can be employed for the CMST technology. Electrospray ioni- 
zation (ESI) allows production of molecular ions directly from samples in solution. Electrospray ionization is one method 
that is compatible with single quadrapole instruments. Very little sample is consumed in obtaining an ESI mass spectrum 
(Smith et al.. Anal. Chem, 60:1948, 1988) but the overall efficiency of ion introduction into the MS vacuum system 
15 remains relatively inefficient. ESt also suffers from the effects of denaturants and detergents which adversely effect 
the ionization step. It can also be used for small and large molecular-weight biopolymers (peptides, proteins, carbo- 
hydrates, and DNA fragments), and lipids. Unlike MALDl, which is pulsed, ESI is a continuous ionization method. With 
ESI as the ionization method, multiply charged ions are usually produced (molecules are more prone to adduct forma- 
tion). 

20 [0383] An alternative ionization method is matrix-assisted laser desorption (MALDl) which can be used to determine 
the molecular weight of tags described here (peptides, proteins, oligonucleotides, and other compounds of biological 
origin as well as of small synthetic polymers can also be measured). The amount of sample needed is very low (pmoles 
or less). The analysis can be performed in the linear mode (high mass, low resolution) up to a molecular weight of m/ 
z 300,000 (in rare cases) or reflectron mode (lower mass, higher resolution) up to a moliecular weight of 10,000. 

25 [0384] APCI and ESI should in general be considered a complement to MALDl. MALDl has the severe disadvange 
of being a non-quantifable ionization method. Electrospray ionization has been installed on HP quadapole. the Finnigan 
LCQ, the Finnigan TSQ 7000 (Medicine), the PE/Sciex instrument, and the Micromass instrument. Both APCI and ESI 
have the advantage over MALDl of being capable of making real time measurements of tags during any type of sep- 
aration methodology (i.e., HPLC and electrophoresis). 

30 [0385] Fluorescent tags can be identified and quantitated most directly by their absorption and fluorescence emission 
wavelengths and intensities. 

[0386] While a conventional spectrofluorometer is extremely flexible, providing continuous ranges of excitation and 
emission wavelengths (l^x, Isi, Iga)* ^^e specialized instruments such as flow cytometers and laser-scanning micro- 
scopes require prot>es that are excitable at a single fixed wavelength. In contemporary instruments, this is usually the 

35 488-nm line of the argon laser. 

[0387] Fluorescence Intensity per probe molecule is proportional to the product of e and QY. The range of these 
parameters among fluorophores of current practical importance is approximately 10.000 to 100,000 cnr^M^^ for e and 
0.1 to 1.0 forQY. When absorption is driven toward saturation by high-intensity illumination, the irreversible destruction 
of the excited fluorophore (photobleaching) becomes the factor limiting fluorescence detectability. The practical impact 

<o of photobleaching depends on the fluorescent detection technique in question. 

[0388] It will be evident to one in the art that a device (an interface) may be interposed between the separation and 
detection steps to permit the continuous operation of size separation and tag detection (in real time). This unites the 
separation methodology and instrumentation with the detection methodology and instrumentation forming a single 
device. For example, an interface is interposed between a separation technique and detection by mass spectrometry 

45 or potentiostatic amperometry. 

[0389] The function of the interface is primarily the release of the (e.g., mass spectrometry) tag from analyte. There 
are several representative implementations of the interface. The design of the interface is dependent on the choice of 
deavable linkers. In the case of light or photo-cleavable linkers, an energy or photon source is required. In the case 
of an acid-labile linker, a base-labile linker, or a disulfide linker, reagent addition is required within the interface. In the 

50 case of heat-labile linkers, an energy heat source is required. Enzyme addition is required for an enzyme-sensitive 
linker such as a specific protease and a peptide linker, a nuclease and a DNA or RNA linker, a glycosylase. HRP or 
phosphatase and a linker which is unstable after cleavage (e.g., similar to chemiluminescent substrates). Other char- 
acteristics of the interface include minimal band broadening, separation of DNA firom tags before Injection into a mass 
spectrometer. Separation techniques include those based on electrophoretic methods and techniques, afflnity tech- 

55 niques, size retention (dialysis), filtration and the tike. 

[0390] It is also possible to concentrate the tags (or nucleic acid-linker-tag construct), capture electrophoretically, 
and then release into alternate reagent stream which is compatible with the particular type of bntzation method selected. 
The interface may also be capable of capturing the tags (or nucleic add-linker-tag construct) on microbeads, shooting 
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the bead(s) into chamber and then prefonning laser desorption/vaporization. Also it is possible to extract in flow into 
alternate buffer (e.g., from capillary electrophoresis buffer into hydrophobic buffer across a permeable membrane). It 
may also be desirable in some uses to deliver tags into the mass spectrometer intermittently which would comprise a 
further function of the interface. Another function of the interface is to deliver tags from multiple columns into a mass 
5 spectrometer, with a rotating time slot for each column. Also, it is possible to deliver tags from a single column into 
multiple MS detectors, separated by time, coilect^each set of tag^ for a few milliseconds, and then deliver to a mass 
spectrometer. 

[0391] The following is a list of representative vendors for separation and detection technologies which may be used 
in the present invention. Hoefer Scientific Instruments (San Francisco, CA) manufactures electrophoresis equipment 

10 (Two Step™, Poker Face™ II) for sequencing applications. Phanmacia Biotech (Piscataway, NJ) manufactures elec- 
trophoresis equipment for DNA separations and sequencing (PhastSystem for PCR-SSCP analysis. MacroPhor Sys- 
tem for DNA sequencing). Perkin Elmer/Applied Biosystems Division (ABI, Foster City, CA) manufactures semi-auto- 
mated sequencers based on fluorescent-dyes (ABI373 and ABI377). Analytical Spectral Devices (Boulder, CO) man- 
ufactures UV spectrometers. Hitachi Instruments (Tokyo, Japan) manufactures Atomic Absorption spectrometers, Flu- 

15 orescence spectrometers, LC and GC Mass Spectrometers, NMR spectrometers, and UV-VIS Spectrometers. PerSep- 
tive Biosystems (Framingham, MA) produces Mass Spectrometers (Voyager™ Elite). Bruker Instruments Inc. (Manning 
Park, MA) manufactures FTIR Spectrometers (Vector 22). FT-Raman Spectrometers, Time of Flight Mass Spectrom- 
eters (Reflex II™), Ion Trap Mass Spectrometer (Esquire™) and a Maldi Mass Spectrometer. Analytical Technology 
Inc. (ATI, Boston, MA) makes Capillary Gel Electrophoresis units, UV detectors, and Diode Array Detectors. Teledyne 

20 Electronic Technologies (Mountain View, CA) manufactures an Ion Trap Mass Spectrometer (3DQ Discovery™ and 
the 3DQ Apogee™). Perkin Elmer/Applied Biosystems Division (Foster City. CA) manufactures a Sciex Mass Spec- 
trometer (triple quadrupole LC/MS/MS, the AP1 100/300) which is compatible with electrospray. Hewlett-Packard (Santa 
Clara, CA) produces Mass Selective Detectors (HP 5972A), MALDI-TOF Mass Spectrometers (HP G2025A), Diode 
An-ay Detectors, CE units, HPLC units (HP1090) as well as UV Spectrometers. Finnigan Corporation (San Jose, CA) 

25 manufactures mass spectrometers (magnetic sector (MAT 95 S™), quadrapole spectrometers (MAT 95 SQ™) and 
four other related mass spectrometers). Rainin (Emeryville. CA) manufactures HPLC instruments. 
[0392] The methods and compositions descrit>ed herein permit the use of cleaved tags to serve as maps to particular 
sample type and nucleotide identity. At the beginning of each sequencing method, a particular (selected) primer is 
assigned a particular unique tag. The tags map to either a sample type, a dideoxy terminator type (in the case of a 

30 Sanger sequencing reaction) or preferably both. Specifically, the tag maps to a primer type which in turn maps to a 
vector type, which in turn maps to a sample identity. The tag may also may map to a dideoxy terminator type (ddTTP, 
ddCTP, ddGTP, ddATP) by reference into which dideoxynucleotide reaction the tagged primer is placed. The sequenc- 
ing reaction is then performed and the resulting fragments are sequentially separated by size in time. 
[0393] The tags are cleaved from the fragments in a temporal frame and measured and recorded in a temporal frame. 

35 The sequence is constructed by comparing the tag map to the temporal frame. That is, all tag identities are recorded 
in time after the sizing step and related become related to one another in a temporal frame. The sizing step separates 
the nucleic acid fragments by a one nucleotkle increment and hence the related tag identities are separated by a one 
nucleotide increment. By foreknowledge of the dideoxy^tenninator or nucleotide map and sample type, the sequence 
is readily deduced in a linear fashion. 

40 [0394] A genetic fingerprinting system of the present invention consists of, in general, a sample introduction device, 
a device to separate the tagged samples of interest, a splitting device to deviate a variable amount of the sample to a 
fraction collector, a device to cleave the tags from the samples of interest, a device for detecting the tag, and a software 
program to analyze the data collected and display it In a differential display mode. It will be evident to one of ordinary 
skill in the art when in possession of the present disclosure that this general description may have many variances for 

<5 each of the components listed. As best seen in Figure 1 5, an exemplary genetic fingerprinting system 1 0 of the present 
invention consists of a sample introduction device 12, a separation device 14 that separates the samples by high- 
performance liquid chromatography (HPLC), a splitting device 13, a fraction collector 15, a photocleavage device 16 
to cleave the tags from the samples of Interest, a detection device 18 that detects the tags by means of an electro- 
chemical detector, and a data processing device 20 with a data analysis software program that analyzes the results 

50 from the detection device. Each component is discussed in more detail below. 

[0395] The sample introduction device 1 2 automatically takes a measured aliquot 22 of the PCR products generated 
in the genetic fingerprinting procedure and delivers it through a conventional tube 24 to the separation device 14 
(generally an HPLC). The sample introduction device 12 of the exemplary embodiment consists of a temperature- 
controlled autosampler 26 that can accommodate micro-titer plates. The autosampler 26 is temperature controlled to 

55 maintain the integrity of the nucleic acid samples generated and is able to inject 25 fil or less of sample. Manufacturers 
of this type of sample introduction device 12 are. for example, Gilson (Middleton, Wl). 

[0396] The sample introduction device is operatively connected in series to the separation device 1 4 by the conven- 
tional tube 24. The PCR products in the measured aliquot 22 received in the separation device 14 are separated 
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temporally by high performance liquid chromatography to provide separated DNA fragments. The high-performance 
liquid chromatograph may have an isocratic, binary, or quaternary pump(s) 27 and can be purchased from multiple 
manufacturers (e.g., Hewlett Packard (Palo Alto, CA) HP 1100 or 1090 series, Beckman Instruments Inc. 
(800-742-2345). Bioanalytical Systems. Inc. (800-845-4246), ESA, Inc. (508) 250-700). Perkin-Elmer Corp. 

5 (800-762-4000). Varian Instruments (800-926-3000). Waters Corp, (800-254-4752)). 

[0397] _ The.sepajation device 14 iridudes an analytical HPLC column 28 suitable for use to separate the oligonucle- 
otides. The column 28 is an analytical HPLC, for example, non-porous polysTyrene divihylbenzerie (2.2 nm particle" 
size) solid support modified which can operate within a pH range of 2 to 12, pressures of up to 3000 psi and a tem- 
perature range of 10 to 70**C. A temperature-control device (e.g. a column oven) (not shown) may be used to control 

10 the temperature of the column. Such temperature-control devices are known in the art and may be obtained from, for 
example, Rainin Instruments (subsidiary of Varian Instrument. Palo Alto, CA). A suitable column 28 is available under 
the commercial name of DNAsep® and is available from Serasep (San Jose, CA). Other suitable analytical HPLC 
columns are available from other manufacturers (e.g., Hewlett Packard (Palo Alto, CA), Beckman Industries (Brea, 
CA), Waters Corp, (Milford, MA), and SupeIco (Bellefonte. PA). 

15 [0398] The separation device 14 in the illusfrated embodiment incorporates 'the sample splitter 1 3, and the sample 
splitter Is connected to the flowing sfream of the sample. The sample splitter 13 is adapted to divert in a conventional 
manner variable amounts of sample to the fraction collector 1 5 either for further analysis or storage. The fraction 
collector 1 5 must be able to accommodate small volumes, have temperature control to low temperatures, and have 
adjustable sampling times. Manufacturers of in-line splitters include Upchurch (Oak Harbor, WA). 

20 [0399] The fraction collector 15 is attached to the HPLC/LC device via a sfream-splitter line 29. Fraction collectors 
1 5 permit the collection of specific peaks, DMAs. RNAs. and nucleic acid fragments or molecules of interest into tubes, 
wells of microtiter plates, or containers. Additionally, fraction collectors 1 5 can collect all or part of a set of nucleic acid 
fragments separated by HPLC or LC. Manufacturers of fraction collectors include Gilson (Middieton, Wl), and Isco 
(Lincoln, NE). The use of a fraction collector 15 in this technology provides considerable substantial advantages over 

25 gel based systems. For example, it is possible to directly done nudeic acids fragments recovered by HPLC or LC 
methods. In addition, it is possible to amplify nudeic acids fragments recovered by HPLC or LC methods by PCR. 
These two methods permit the rapkj identification of nudeic acid fragments of interest on a sequence level. Both 
methods are tedious and ineffective when used in conjundion with gel-based systems. 

[0400] In the illusfrated embodiment, the fraction collector 1 5 is an individual component of the genetic fingerprinting 
30. system 10. In an alternate embodiment (not shown), the fraction collector 1 5 is incorporated in the sample infroduction 
device 12. Accordingly, the sfream-splitter line 32 directs the diverted sample from the sample splitter 13 back to the 
sample infroduction device 12. 

[0401] A sfream of the separated DNA fragments (e.g., sequencing reaction products) flows through a conventional 
tube 30 from the separation device 14 downsfream of the sample splitter 13 to the deavage device 16. Each of the 
35 DNA fragments is labeled with a unique deavable (e.g., photodeavable) tag. The flowing stream of separated DNA 
fragments pass through or past the deaving device 16, where the tag is removed for detection (e.g., by mass spec- 
trometry or with an elecfrochemical detector). In the exemplary embodiment, the deaving device 16 is a photocleaving 
unit such that flowing sfream of sample is exposed to selected light energy and wave length. In one embodiment, the 
sample enters the photodeaving unit 16 and is exposed to the selected light source for a selected duration of time. In 
40 an altemate embodiment, the flowing stream of sample is carried adjacent to the light source along a path that provides 
a sufficient exposure to the light energy to deave the tags from the separated DNA fragments. 
[0402] A photocleaving unit is available from Supeico (Bellefonte. PA). Photocleaving can be perfomied at multiple 
wavelengths with a mercury/xenon are lamp. The wavelength accuracy Is about 2 nm with a bandwidth of 10 nm. The 
area irradiated Is circular and typically of an area of 10-100 square centimeters. In altemate embodiments, other cleav- 
es ing devices, which cleave by acid, base, oxidation, reduction, fluoride, thiol exchange, photolysis, or enzymatic condi- 
tions, can be used to remove the tags from the separated DNA fragments. 

[0403] After the deaving device 16 deaves the tags from the separated DNA fragments, the tags flow through a 
conventional tube 32 to the detection device 18 for detection of each tag. Detection of the tags can be based upon the 
difference in electrochemical potential between each of the tags used to label each kind of DNA generated in the PCR 

50 step. The electrochemical detedor 18 can operate on either coulometric or amperometric principles. The prefenred 
elecfromechanical detector 18 is the coulometric detector, which consists of a flow-through or porous-cartwn graphite 
amperomefric detector where the column eland passes through the electrode resulting in 100% detection efficiency. 
To fully detect each component, an anray of 16 coulometric detectors each held at a different potential (generally at 60 
mV increments) is utilized. Examples of manufacturers of this type of detector are ESA (Bedford, MA) and Bioanalytical 

55 Systems Inc. (800-845-4246). 

[0404] In an altemate embodiment illustrated schematically in Figure 16, the sample introduction device 12, the 
separating device 14, and the deavage device 16 are serially connected as discussed above for maintaining the flow 
of sample. The deavage device 16 is connected to a detection device 18 which is a mass specfrometer 40 or the like 
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that detects the tags based upon the difference in molecular weight between each of the lags used to label each kind 
of DNA generated in the PGR step. The best detector based upon differences in mass is the mass spectrometer For 
this use, the mass spectrometer 40 will typically have an atmospheric pressure ionization (API) interface with either 
electrospray or chemical ionization, a quadrupote mass analyzer, and a mass range of at least 50 to 2600 m/z. Examples 
5 of manufacturers of a suitable mass spectrometer are: Hewlett Packard (Palo Alto, CA) HP 1100 LC/MSD, Hitachi 
- Instruments (San Jose..CA)._M-1200H LC/MS, Perkin Elmer Corporation, Applied Biosystems Division (Foster City, 
CA) AP1 100 LC/MS or API 300 LC/MS/MS. Finnigan Corporation (San Jose, CA) LCQ. MAT 95~S7Bruker Analytical 
Systems, Inc. (Billerica, MA) APEX, BioAPEX, and ESQUIRE and Micromass (U.K.). 

[0405] The detection device 1 8 is electrically connected to a data processor and analyzer 20 that receives data from 
10 the detection device. The data processor and analyzer 20 includes a software program that identifies the detected tag. 
The data processor and analyzer 230 In alternate embodiments is operatively connected to the injection device 12. 
the separation device 14, the fraction collector 15, and/or the cleaving device 16 to control the different components 
of the genetic fingerprinting system 10. 

[0406] The software package maps the electrochemical signature of a given tag to a specific primer, and a retention 
15 time. Software generated nucleic acid profiles are then compared (length to length, fragment to fragment) and the 
results are reported to the user. The software will highlight both similarities and differences in the nucleic acid fragment 
profiles. The software will also be able to direct the collection of specific nudeic acid fragments by the fraction collector 
15. 

[0407] The software package maps the m/z signature of a given tag to a specific primer, and a retention time. Software 
20 generated nucleic acid profiles are then compared (length to length, fragment to fragment) and the results are reported 
to the user. The software will highlight both similarities and differences in the nucleic acid fragment profiles. The software 
will also be able to direct the collection of specific nucleic acid fragments by the fraction collector 
[0408] The system 18 in accordance with the present invention is provided by operatively interconnecting the sys- 
tem's multiple components. Accordingly, one or more system components, such as the sample introducing device 12 
25 and the detecting device 1 8 that are in operation in a lab can be combined with the system's other components, (e.g.. 
the separating device 14, cleaving device 16, and the data processor and analyzer 20 in order to equip the lab with 
the DNA sequendng system 10 of the present invention. 

[0409] /Another embodiment of the present invention provides a differential display system which consists of, in gen- 
eral, a sample introduction device, a device to separate the tagged samples of interest, a splitting device to deviate a 

30 variable amount of the sample to a fraction collector, a unit to cleave the tags from the samples of interest, a device 
for detecting the tag. and a software program to analyze the data collected and display it in a differential display mode. 
It will be evident to one of ordinary skill in possession of the present disclosure that the general description may have 
many variances for each of the components listed. The differential display system of an exemplary embodiment of the 
present invention consists of similar components illustrated in Figure 15. including the sample introduction device 12, 

35 the separation device 1 4 for separating the samples by high-performance liquid chromatography (HPLC), the splitting 
device 13. the fraction collector 15, the photodeavage device 16 to cleave the tags from the samples of interest, the 
detection device 1 8 for detection of the tags by electrochemistry, and the data processor and analyzer 20 with a software 
program. Each component is discussed in more detail below. 

[0410] In the differential display system, the sample introduction device 12 automatically takes a measured aliquot 

40 22 of the PCR product generated in the differential display procedure and delivers it through a conventional tube 24 
to the separation device 14 (generally an HPLC). The sample Introductton device 12 of the exemplary embodiment 
consists of a temperature-controlled autosampler 26 that can accommodate micro-titer plates. The autosampier 26 
must be temperature controlled to maintain the integrity of the nucleic acid samples generated and be able to inject 
25 \i\ or less of sample. Manufacturers of this type of product are represented, for example, by Gilson (Middleton, Wl), 

45 [041 1] The sample introduction device is operatively connected in series to the separation device by the conventional 
tube 24. The PCR products in the measured aliquot 22 received in the separation device 14 are separated temporally 
by high perfomnance liquid chromatography to provide separated DNA fragments. The high-performance liquid chro- 
matograph may have an isocratic. binary, or quaternary pump(s) 27 and can be purchased from multiple manufacturers 
(e.g., Hewlett Packard (Palo /Mto, CA) HP 1100 or 1090 series. Analytical Technology Inc. (Madison, Wl). Perkin Elmer, 

50 Waters, etc.). The separation device 14 includes an analytical HPLC column 28 suitable for use to separate the oligo- 
nucleotides. The column 28 is an analytical HPLC, for example, non-porous polystyrene divlnylbenzene (2.2 [ur\ particle 
size) solid support modified which can operate within a pH range of 2 to 12, pressures of up to 3000 psi and a tem- 
perature range of 10 to 70^C. A temperature-control device (e.g.. a column oven) (not shown) may be used to control 
the temperature of the column. Such temperature-control devices are known in the art, and may be obtained from, for 

55 example, Rainin Instruments (subsidiary of Varian Instrument, Palo Alto, CA). A suitable column 28 is available under 
the commercial name of DNAsep® and is available from Serasep (San Jose, CA), Other suitable analytical HPLC 
columns are available from other manufacturers (e.g., Hewlett Packard (Palo Alto. CA) (Beckman Industries (Brea. 
CA). Waters Corp. (Milford, MA), and Supeico (Bellefonte, PA). 
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[0412] In the Illustrated embodiment, the fraction collector 15 Is an individual component of the differential display 
system 10 that is coupled to the system's other components. In an alternate embodiment, the fraction collector 15 is 
incorporated in the sample introduction device 12. Accordingly, the stream-splitter line 32 directs the diverted sample 
from the sample splitter 13 back to the sample introduction device 12. 

5 [0413] The separation device 14 in the illustrated embodiment incorporates the sample splitter 1 3 that is connected 
to the.flowing.stream_of ttie sampje. The sample splitter J 3 is adapted to divert in a conventional manner variable 
amounts of sample to the fraction collector 15 either for further analysis or storage. The fraction collector 15 must' be 
able to accommodate small volumes, have temperature control to low temperatures, and have adjustable sampling 
times. Manufacturers of in-line splitters include Upchurch (Oak Harbor, WA), 

10 [0414] A stream of the separated DNA fragments flow through a conventional tube 30 from the separation device 
1 4 downstream of the sample splitter 1 1 3 to the cleavage device 1 6. Each of the PGR products is labeled with a unique 
deavable (e.g.. photodeavable) tag. The flowing stream of separated DNA fragments pass through or past the cleaving 
device 16 where the tag is removed for detection by electrochemical detection. In the exemplary embodiment, the 
cleaving device 16 is a photodeaving unit such that flowing stream of sample is exposed to selected light energy. In 

15 one embodiment, the sample enters the photodeaving unit 1 6 and is exposed to the selected light source for a selected 
duration of time. In an alternate embodiment, the flowing stream of sample is carried in a suitable tube portion or the 
like adjacent to the light source along a path that provides a sufficient exposure to the light source to deave the tags 
from the separated DNA fragments. 

[0415] A photodeaving unit is available from Supeico (Bellefonte, PA). Photodeaving can be performed at multiple 
20 wavelengths with a mercury/xenon arc lamp. The wavelength accuracy is about 2 nm with a bandwidth of 10 nm. The 
area irradiated is drcular and typically of an area of 1 0-1 00 square centimeters. In altemate embodiments, other deav- 
Ing devices, which deave by acid, base, oxidation, reduction, fluoride, thil exchange, photolysis, or enzymatic condi- 
tions, can be used to remove the tags from the separated DNA fragments. 

[0416] After the cleaving device 16 deaves the tags from the separated DNA fragments, the tags flow through a 
25 conventional tube 32 to the detection device 18 for detection of each tag. Detection of the tags is based upon the 
difference in electrochemical potential between each of the tags used to label each kind of DfMA generated In the PGR 
step. The electrochemical detector 18 can operate on either coulometric or amperometric principles. The preferred 
electromechanical detector 18 is the coulometric detector, which consists of a flow-through or porous-carbon graphite 
amperometric detector where the column eluent passes through the electrode resulting in 100% detection efficiency. 
30 To fully detect each component, an array of 1 6 coulometric detectors each held at a different potential (generally at 60 
mV increments) is utilized. The manufacturers of this type of detector include ESA (Bedford. MA) and Bioanalytical 
Systems Inc. (800-845-4246). 

[0417] In an altemate embodiment of the differential display system illustrated schematically in Figure 1 6, the sample 
introduction device 12, the separating device 14, and the deavage device 16 are serially connected as discussed 

35 above 'for maintaining the flow of sample. The deavage device 16 is connected to a detection device 18 that detects 
the tags based upon the difference in molecular weight between each of the tags used to label each kind of DNA 
generated in the PGR step. The best detector based upon differences in mass is the mass spectrometer 40, For this 
use, the mass spectrometer will typically have an atmospheric pressure ionization (API) interface with either electro- 
spray or chemical ionization, a quadrupole mass analyzer, and a mass range of at least 50 to 2600 m/z. Examples of 

40 manufacturers of a suitable mass spectrometer are: Hewlett Packard (Palo Alto, CA) HP 1 1 00 LC/MSD, Hitachi Instm- 
ments (San Jose. GA). M-1200H LC/MS. JEOL USA. Inc. (Peabody, MA). Perkin Elmer Corporation, Applied Biosys- 
terns Division (Foster City, CA) AP1 100 LG/MS or API 300 LG/MS/MS. Finnigan Corporation (San Jose. CA) LCQ, 
MAT 95 S. MAT 95 S Q, MAT 900 S. MAT. 900 S Q, and SSQ 7000, Bmker Analytical Systems, Inc. (Billerica. MA) 
APEX. BioAPEX. and ESQUIRE. 

45 [041 8] The detection device 1 8 is electrically connected to a data processor and analyzer 20 that receives data from 
the detection device. The data processor and analyzer 20 indudes a software program that identifies the detected tag 
and its position in the DNA sequence. The data processor and analyzer 20 in altemate embodiments is operatively 
connected to the injection device 12, the separation device 14, the fraction collector 15, and/or the deaving device 16 
to control the different components of the differential display system. 

50 [0419] The software package maps the signature of a given tag to a specific primer, and a retention time. Software 
generated nucleic acid profiles are then compared (length to length, fragment to fragment) and the results are reported 
to the user. The software will highlight both similarities and differences in the nucleic acid fragment profiles. The software 
will also be able to direct the collection of specific nudeic acid fragments by the fraction collector 15. 
[0420] The software package maps the m/z signature of a given tag to a specific primer, and a retention time. Software 

55 generated nucleic acid profiles are then compared (length to length, fragment to fragment) and the results are reported 
to the user. The software highlights both similarities and differences in the nudeic acid fragment profiles. The software 
is also able to direct the collection of spedfic nudeic acid fragments by the fraction collector. 
[0421] The differential display system is provided by operatively interconnecting the system's multiple components. 
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Accordingly, one or more system components, such as the sample introducing device 12 and the detecting device 18 
that are in operation in a lab can be combined with the system's other components, e.g., the separating device 14, 
cleaving device 1 6, and the data processor and analyzer 20, in order to equip the lab with a system in accordance with 
the present invention. 

5 [0422] Single nucleotide extension assay, oligo-ligation assay or oligonucleotide probe based assay systems of the 
present invenjion consist of, in generaj, a sample introduction device, a device to separate the tagged samples of 
interest, a device to cleave the tags from the samples oflntirestra device fbr~deTecting the tag, an<J a~sdftware program" " 
to analyze the data collected. It will be evident to one of ordinary skill in the art when in possession of the present 
disclosure that the general description may have many variances for each of the components listed. As best seen in 

10 Figure 1 7, (need Figure #) a preferred single-nucleotide extension assay, oligo-ligation assay or oligonudeotide-probe 
based assay system 200 consists of a sample introduction device 212. a separation device 214 that separates the 
samples by high-performance liquid chromatography, a cleaving device 216 to cleave the tags from the samples of 
interest, a detection device 218 of the tags by mass spectrometry, and a data processor and analyzer 220 which 
includes a software program. Each component is discussed in more detail below. 

^ 5 [0423] The sample introduction device 21 2 automatically takes a measured aliquot 222 of the nucleic acid fragment 
generated by a variety of methods (PGR. ligations, digestion, nucleases, etc.) and delivers it through a conventional 
tube 224 to the separation device 214 (generally an HPLC). The sample introduction device 212 of the exemplary 
embodiment consists of a temperature-controlled autosampler 226 that can accommodate micro-titer plates. The au- 
tosampler 226 must be temperature controlled to maintain the integrity of the nucleic acid samples generated and be 

20 able to inject 25 ^1 or less of sample. Manufacturers of this product are represented, for example, by Gilson (Middleton 
Wl). 

[0424] The sample introduction device is operatively connected in series to the separation device by the conventional 
tube 224, The nucleic acid products (which may be produced by PGR, ligation reactions, digestion, nucleases, etc.) in 
the measured aliquot 222 receive in the separation device 214 are separated temporally by high performance liquid 
25 chromatography. The high-perfomiance liquid chromatograph may have an isocratic, binary, or quatemary pump(s) 

227 and can be purchased from multiple manufacturers (e.g., Hewlett Packard (Palo Alto, GA) HP 1100 or 1090 series. 
Beckman Instruments Inc. (800-742-2345). Bioanalytical Systems, Inc. (800-845-4246), ESA, Inc. (508) 250-700). 
Pertcin-Elmer Gorp. (800-762-4000), Varian Instalments (800-926-3000), Waters Corp. (800-254-4752)). 

[0425] The separation device 214 includes an analytical HPLC column 228 suitable for use to separate the nucleic 
30 acid fragments. The column 228 is an analytical HPLC, for example, non-porous polystyrene divinylbenzene (2.2 \un 
particle size) solid support which can operate within a pH range of 2 to 1 2, pressures of up to 3000 psi and a temperature 
range of 10 to 70°G. A temperature-control device (e.g., a column oven) (not shown) may be used to control the 
temperature of the column. Such temperature-control devices are known in the art. and may be obtained from, for 
example, Rainin Instruments (subsidiary of Varian Instrument, Palo Alto, CA). A suitable column 228 is available under 
35 the commercial name of DNAsep® and is available from Serasep (San Jose, GA). A wide variety of HPLC columns 

228 can be used for this particular technological unit since single-base pair resolution is not necessarily required. Other 
suitable analytical HPLC columns are available from other manufacturers (e.g., Hewlett Packard (Palo Alto, CA). Beck- 
man Instmments. Inc. (Brea, GA), and Waters Gorp. (Milford, MA)). 

[0426] A stream of the separated DNA fragments (e.g., sequencing reaction product) flows through a conventional 
<0 tube 230 from the separation device 214 to the cleavage device 216. Each of the DNA fragments is labeled with a 
unique cleavable (e.g. , photocleavable) tag. The flowing stream of separated DNA fragments pass through or past the 
cleaving device 216 where the tag is removed for detection by mass spectrometry or with a electrochemical detector. 
The phbtodeavlng unit Is available from Supeico (Bellefonte, PA). Photodeaving can be perfonmed at multiple wave- 
lengths with a mercury/xenon arc lamp. The wavelength accuracy is about 2 nm with a bandwidth of 10 nm. The area 
45 irradiated is circular and typically of an area of 10-100 square centimeters. In alternate embodiments, other cleaving 
devices, which cleave by acid, base, oxidation, reduction, fluoride, thiol exchange, photolysis, or enzymatic conditions, 
can be used to remove the tags from the separated DNA fragments, 

[0427] After the deaving device 216 deaves the tags from the separated DNA fragments, the tags flow through a 
conventional tube 232 to the detection device 218 for detection of each tag. Detection of the tags can be based upon 

50 the difference in molecular weight between each of the tags used to label each kind of DNA generated in the various 
assay steps. The best detector based upon differences in mass is the mass spectrometer. For this use, the mass 
spectrometer will typically have an atmospheric pressure ionization (API) interface with either electrospray or chemical 
ionization, a quadrupole mass analyzer, and a mass range of at least 50 to 2600 m/z. Examples of manufacturers of 
a suitable mass spectrometer are: Hewlett Packard (Palo Alto, CA) HP 100 LG/MSD. Hitachi Instalments (San Jose, 

55 CA) M-1 200H LG/MS. Perkin Elmer Corporation, Applied Biosystems Division (Foster City, CA) AP1 1 00 LC/MS or API 
300 LC/MS/MS. Finnigan Corporation (San Jose. GA) LGQ. Bruker Analytical Systems. Inc. (Billerica, MA). ESQUIRE, 
and Micromers (U.K). 

[0428] In an alternate embodiment illustrated schematically in Figure 18. the sample introduction device 212. the 
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separating device 214, and the cleavage device 216 arc serially connected as discussed atx)ve for maintaining the 
flow of sample. The cleavage device 216 is connected to a detection device 218, which is an electrochemical detector 
240 that detects the tags based upon the difference in electrochemical potential between each of the tags used to label 
each kind of DNA generated in the sequencing reaction step. The electrochemical detector 240 of the exemplary 

5 emtXKiiment can operate on either coulometric or amperometric principles. The prefen-ed electrochemical detector 240 
is the coulometric detector, which consists of a flow-through or porous-carison graphite amperometric detector where 
the columrTeluent passes through the" electrode resulting in 1 00% detection efRciehcy. To fully detect each component, 
an array of 16 coulometric detectors each held at a different potential (generally at 60 mV increments) is utilized. 
Examples of manufacturers of this type of detector are ESA (Bedford, MA) and Bioanalytical Systems Inc. 

10 (800-845-4246). Additional manufacturers of electrochemical detectors can be found in the list of other manufacturers 
found below. 

[0429] The electrochemical detector 240 is electrically connected to the data processor and analyzer 220 with the 
software package discussed above. The software package maps the detected property (e.g., the mass or electrochem- 
ical signature) of a given tag to a specific sample ID. The software will be able to identify the nucleic acid fragment of 

IS interest and load the ID information into respective databases. 

[0430] The DNA analysis systems described herein have numerous advantages over the traditional gel based sys- 
tems. One of the principal advantages is that thiese systems may be fully automated. By utilizing an HPLC based 
separation system, samples can be automatically injected into the HPLC where as gel based systems require manual 
loading. There is also a significant time savings found in the set-up time (no gel forms to clean, no gel to pour), and 

20 the analysis time (greater than 4 hours for a large gel versus much shorter times (5 minutes to an hour) for an HPLC 
analysis Additionally, there is a sample throughput advantage. By utilizing the tags described in this invention, many 
samples can be analyzed in one batch (potentially 384 samples/lane) whereas the gel-based analyses are limited to 
the 4 fluorophores available or one sample/lane. The gels used are Inherently delicate and can easily break or contain 
an air bubble or other flaw rendering the whole gel or several lanes useless. HPLC columns are rugged and, when 

25 purchased pre-packed, are free of packing defects creating a consistent, generally uniform separation path. The HPLC 
systems also lend towards better quality assurance in that intemal standards can be utilized due to the reproducibility 
of the HPLC columns. Gel quality is inconsistent both between gels as well as within a gel making use of standards 
neariy impossible. Finally, both the mass spectrometry and electrochemical detectors are more sensitive than the 
detectors utilized in the gel based systems allowing for lower limits of detection and analysts of less sample which 

30 would be useful for non-PCR based analyses. 

Tagged Probes in Array-Based Assays 

[0431] Arrays with covalently attached oligonucleotides have been made used to perform DNA sequence analysis 
35 by hybridization (Southern et al., Genomics 13: 1008, 1992; Drmanac et al., Science 260: 1649, 1993), detemiine 
expression profiles, screen for mutations and the like. In general, detection for these assays uses fluorescent or radi- 
oactive labels. Fluorescent labels can be identified and quantitated most directly by their absorption and fluorescence 
emission wavelengths and Intensity: A microscope/camera setup using a fluorescent light source is a convenient means 
for detecting fluorescent label. Radioactive labels may be visualized by standard autoradiography, phosphor image 
40 analysis or CCD detector. For such labels the number of different reactions that can be detected at a single time is 
limited. For example, the use of four fluorescent molecules, such as commonly employed in DNA sequence analysis, 
limits anaylsis to four samples at a time. Essentially, because of this limitation, each reactk)n must be individually 
assessed when using these detector methods. 

[0432] A more advantageous method of detection allows pooling of the sample reactions on at least one an-ay and 
45 simultaneous detection of the products. By using a tag, such as the ones described herein, having a different molecular 
weight or other physical attribute in each reaction, the entire set of reaction products can be harvested together and 
analyzed. 

[0433] As noted above, the methods described herein are applicable for a variety of purposes. For example, the 

an-ays of oligonucleotides may be used to control for quality of making an-ays, for quantitation or qualitative analysis 
50 of nudeic acid molecules, for detecting mutations, for determining expression profiles, for toxicology testing, and the 
like. 

1. Probe quantitation or typing 

55 [0434] In this embodiment, oligonucleotides are immobilized per element in an array where each oligonucleotide in 
the element is a different or related sequence. Preferably, each element possesses a known or related set of sequences. 
The hybridization of a labeled probe to such an an-ay penmits the characterization of a probe and the identification and 
quantification of the sequences contained in a probe population. 
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[0435] A generalized assay format that may be used in the particular applications discussed below is a sandwich 
assay format. In this format, a plurality of oligonucleotides of known sequence are immobilized on a solid substrate. 
The immobilized oligonucleotide is used to capture a nucleic acid (e.g., RNA, rRNA. a PGR product, fragmented DNA) 
and then a signal probe is hybridized to a different portion of the captured target nucleic acid. 
5 [0436] Another generalized assay format is a secondary detection system. In this format, the arrays are used to 
identify and quantify labeled nucleic acids that have been used in a primary binding assay. For example, if an assay 
results in a labeled nudeic acid, the identity^ of that nucleic acid can be determined by hybridization to an array. These ~ 
assay fonmats are particularly useful when combined with cleavable mass spectometry tags. 

10 2. Mutation detection 

[0437] Mutations involving a single nucleotide can be identified in a sample by scanning techniques, which are suit- 
able to identify previously unknown mutations, or by techniques designed to detect, distinguish, or quantitate known 
sequence variants. Several scanning techniques for mutation detection have been developed based on the observation 
15 that heteroduplexes of mismatched complementary DNA strands, derived from wild type and mutant sequences, exhibit 
an abnormal migratory behavior. 

[0438] The methods described herein may be used for mutation screening. One strategy for detecting a mutation in 
a DNA strand is by hybridization of the test sequence to target sequences that are wild-type or mutant sequences. A 
mismatched sequence has a destabilizing effect on the hybridization of short oligonucleotide probes to a target se- 
20 quence (see Wetmur, Crit Rev. Biochem. Mol. Biol., 26:227, 1 991 ). The test nucleic acid source can be genomic DNA, 
RNA. cDNA, or amplification of any of these nucleic acids. Preferably, amplification of test sequences is first performed, 
followed by hybridization with short oligonucleotide probes immobilized on an array. An amplified product can be 
scanned for many possible sequence variants by determining its hybridization pattern to an array of immobilized oli- 
gonucleotide probes. 

25 [0439] A label, such as described herein, is generally incorporated into the final amplification product by using a 
labeled nucleotide or by using a labeled primer. The amplification product is denatured and hybridized to the an^y. 
Unbound product is washed off and label bound to the array is detected by one of the methods herein. For example, 
when cleavable mass spectrometry tags are used, multiple products can be simultaneously detected. 

30 3. Expression profiles I differential display 

[0440] Mammals, such as human beings, have about 1 00.000 different genes in their genome, of which only a small 
fraction, perhaps 15%, are expressed in any individual cell. Differential display techniques permit the identification of 
genes specific for individual cell types. Briefly, in differential display, the 3* terminal portions of mRNAs are amplified 
35 and identified on the basis of size. Using a primer designed to bind to the 5' boundary of a poly(A) tail for reverse 
transcription, followed by amplification of the cDNA using upstream arbitrary sequence primers, mRNA sub-populations 
are obtained. 

[0441] As disclosed herein, a high throughput method for measuring the expression of numerous genes (e.g., 1 -2000) 
is provided. Within one emt)odiment of the invention, methods are provided for analyzing the pattem of gene expression 

40 from a selected biological sample, comprising the steps of (a) amplifying cDNA from a biological sample using one or 
more tagged primers, wherein the tag is correlative with a particular nucleic add probe and detectable by non-fluores> 
cent spectrometry or potentiometry, (b) hybridizing amplified fragments to an array of oligonucleotides as described 
herein, (c) washing away non-hybridized material, and (d) detecting the tag by non-fluorescent spectrometry or poten- 
tiometry, and therefrom determining the pattem of gene expression of the biological sample. Tag-based differential 

45 display, especially using cleavable mass spectometry tags, on solid substrates allows characterization of differentially 
expressed genes. 

4. Single nucleotide extension assay 

50 [0442] The primer extension technique may be used for the detection of single nucleotide changes in a nucleic acid 
template (Sokolov, Nucleic Acids Res., 18:367^ , 1 989). The technique is generally applicable to detection of any single 
base mutation (Kuppuswamy et al., Proc. Natl, Acad. ScL USA, 88:1143-1147, 1991 ). Briefly, this method first hybridizes 
a primer to a sequence adjacent to a known single nucleotide polymorphism. The primed DNA is then subjected to 
conditions in which a DNA polymerase adds a labeled dNTP, typically a ddNTP, if the next base in the template is 

55 complementary to the labeled nucleotide in the reaction mixture. In a modification. cDNA is first amplified for a sequence 
of Interest containing a single-base difference between two alleles. Each amplified product is then analyzed for the 
presence, absence, or relative amounts of each allele by annealing a primer that is 1 base 5* to the polymorphism and 
extending by one labeled base (generally a dideoxynucleotide). Only when the correct base is available in the reaction 



74 



EP 0 990 047 B1 



will a base to incorporated at the 3*-end of the primer. Extension products are then analyzed by hybridization to an 
an-ay of o!igonucleotides such that a non-extended product will not hybridize. 

[0443] Briefly, in the present invention, each dideoxynucleotide is labeled with a unique tag. Of the four reaction 
mixtures, only one will add a dideoxy-terminatoron to the primer sequence. If the mutation is present, it will be detected 

5 through the unique tag on the dideoxynucleotide after hybridization to the array. Multiple mutations can be simultane- 
ously determined by tagging the DNA primer with a unique tag as well. Thus, the DNA fragments are reacted in four 
~ separate reactions each including a different tagged dideoxyterminator,- wherein the tag is correlative with a particular 
dideoxynucleotide and detectable by non-fluorescent spectrometry, or potentiometry. The DNA fragments are hybrid- 
ized to an array and non-hybridized material is washed away. The tags are cleaved from the hybridized fragments and 

10 detected by the respective detection technology (e.g., mass spectrometry, infrared spectrometry, potentiostatic am- 
perometry or UV/visible spectrophotometry). The tags detected can be correlated to the particular DNA fragment under 
investigation as well as the identity of the mutant nucleotide. 

5. Oligonucleotide ligation assay 

15 

[0444] The oligonucleotide ligation assay (OLA). (Landegen et al.. Science 24t:487. 1988) is used for the identifi- 
cation of known sequences in very large and complex genomes. The principle of OLA is based on the ability of ligase 
to covalently join two diagnostic oligonucleotides as they hybridize adjacent to one another on a given DNA target. If 
the sequences at the probe junctions are not perfectly based-paired, the probes will not be joined by the ligase. When 
20 tags are used, they are attached to the probe, which is ligated to the amplified product. After completion of OLA, 
fragments are hybridized to an array of complementary sequences, the tags cleaved and detected by mass spectrom- 
etry. 

[0445] Within one embodiment of the invention methods are provided for determining the identity of a nucleic acid 
molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, utilizing the technique 

25 of oligonucleotide ligation assay. Briefly, such methods generally comprise the steps of performing amplification on the 
target DNA followed by hybridization with the 5' tagged reporter DNA probe and a 5' phosphorylated probe. The sample 
is incubated with T4 DNA ligase. The DNA strands with ligated probes are captured on the array by hybridization to 
an an^y. wherein non-ligated products do not hybridize. The tags are cleaved from the separated fragments, and then 
the tags are detected by the respective detection technology (e.g., mass spectrometry, infrared spectrophotometry. 

30 potentiostatic amperometry or UV/visible spectrophotometry. 

6. Other assays 

[0446] The methods described herein may also be used to genotype or identification of viruses or microbes. For 

35 example, F+ RNA coliphages may be useful candidates as indicators for enteric virus contamination. Genotyping by 
nucleic acid amplification and hybridization methods are reliable, rapid, simple, and inexpensive altematives to sero- 
typing (Kafatos et. al., Nucieic Acids Res. 7:1541, 1979). Amplification techniques and nucleic aid hybridization tech- 
niques have been successfully used to classify a variety of microorganisms including E. coli (Feng. Mol. Cell Probes 
7:151, 1993), rotavims (Sethabutr et, al., J. Med ViroL 37:192, 1992). hepatitis C virus (Stuyver et. al., J. Gen Virol. 

40 74:1093, 1993), and herpes simplex virus (Matsumoto et. al., J. ViroL Methods 40:119, 1992). 

[0447] Genefic alterations have been described in a variety of experimental mammalian and human neoplasms and 
represent the morphological basis for the sequence of morphological alterafions observed in carcinogenesis (Vogel- 
steln et al., NEJM 3f 9:525. 1988). In recent years with the advent of molecular biology techniques, allelic losses on 
certain chromosomes or mutation of tumor suppressor genes as well as mutations in several oncogenes (e.g., c-myc, 

45 c-jun, and the ras family) have been the most studied entities. Previous wori< (Finkelstein et al.. Arch Surg. 128:526, 
1993) has identified a correlation between specific types of point mutations in the K-ras oncogene and the stage at 
diagnosis in colorectal carcinoma. The results suggested that mutational analysis could provide important information 
of tumor aggressiveness. Including the pattern and spread of metastasis. The prognostic value of TP53 and K-ras-2 
mutational analysis in stage 111 carconoma of the colon has more recently been demonstrated (Pricolo et al., Am. J. 

50 Surg, f 71:41 , 1 996). It is therefore apparent that genotyping of tumors and pre-cancerous cells, and specific mutation 
detection will become increasingly important in the treatment of cancers in humans. 

Tagged Probes In Array-Based Assays 

55 [0448] The tagged biomolecules as disclosed herein may be used to Interrogate (untagged) arrays of biomolecules. 
Prefen-ed anciys of biomolcules contain a solid substrate comprising a surface, where the surface is at least partially 
covered with a layer of poly(ethylenimine) (PEl). The PEl layer comprises a plurality of discrete first regions abutted 
and surrounded by a contiguous second region. The first regions are defined by the presence of a biomolecule and 
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PEI. while the second region is defined by the presence of PEI and the substantial absence of the biomolecule. Pref- 
erably, the substrate is a glass plate or a silicon wafer. However, the substrate may be, for example, quartz, gold, nylon- 
6.6. nylon or polystyrene, as well as composites thereof, as described above. 

[0449] The PEI coating preferably contains PEI having a molecular weight ranging from 100 to 100,000. The PEI 
5 coating may be directly bonded to the substrate using, for example, siiylated PEL Alternatively, a reaction product of 
a bifunctionat coupling agent may be disposed between the substrate surface and the PEI coating, where the reaction 
product is oovalently bonded to both the surface and the PEI coating. and secures the PEI coating to the surface. The - 
bifunctional coupling agent contains a first and a second reactive functional group, where the first reactive functional 
group is, for example, a tri(0-Ci-C5alkyl)silane, and the second reactive functional group is, for example, an epoxide, 
10 isocyanate, isothiocyanate and anhydride group. Prefeaed bifunctional coupling agents include 2-(3.4-epoxycy- 
clohexyl)ethyltrimethoxysilane; 3,4-«poxybutyltrimethoxysilane; 3-isocyanatopropyltriethoxysilane, 3-(trlethoxysilyl)- 
2-methylpropylsuccinic anhydride and 3-(2,3-epoxypropoxy)propyltrimethoxysilane. 

[0450] The array of the invention contains first, biomolecule-containing regions, where each region has an area within 
the range of about 1 ,000 square microns to about 1 00,000 square microns. In a prefenred embodiment, the first regions 

15 have areas that range from about 5.000 square microns to about 25.000 square microns. 

[0451] The first regions are preferably substantially circular, where the circles have an average diameter of about 
10 microns to 200 microns. Whether circular or not, the boundaries of the first regions are preferably separated from 
one another (by the second region) by an average distance of at least about 25 microns, however by not more than 
about 1 cm (and preferably by no more than about 1 ,000 microns). In a prefen-ed array, the boundaries of neighboring 

20 first regions are separated by an average distance of about 25 microns to 100 microns, where that distance is preferably 
constant throughout the an^ay, and the first regions are preferably positioned in a repeating geometric pattern as shown 
in the Figures attached hereto. In a prefen-ed repeating geometric pattern, all neighboring first regions are separated 
by approximately the same distance (about 25 microns to about 100 microns). 

[0452] In preferred arrays, there are from 10 to 50 first regions on the substrate. In another embodiment, there are 
25 50 to 400 first regions on a substrate. In yet another preferred embodiment, there are 400 to 800 first regions on the 
substrate. 

[0453] The biomolecule located in the first regions is preferably a nucleic acid polymer. A preferred nucleic acid 
polymer is an oligonucleotide having from about 15 to about 50 nucleotides. The biomolecule may be amplification 
reaction products having from about 50 to atx>ut 1 ,000 nucleotides. 

30 [0454] In each first region, the biomolecule Is preferably present at an average concentration ranging from lO^ to 
10^ biomolecules per 2,000 square microns of a first region. More preferably, the average concentration of biomolecule 
ranges from 10^ to 10^ biomolecules per 2,000 square microns. In the second region, the biomolecule is preferably 
present at an average concentration of less than 10^ biomolecules per 2,000 square microns of said second region, 
and more preferably at an average concentration of less than 10^ biomolecules per 2.000 square microns. Most pref- 

35 erably. the second regions does not contain any biomolecule. 

[0455] The chemistry used to adhere the layer of PEI to the substrate depends, in substantial part, upon the chemical 
identity of the substrate. The prior art provides numerous examples of suitable chemistries that may adhere PEI to a 
solid support. For example, when the substrate Is nylon-6.6. the PEI coating may be applied by the methods disclosed 
in Van Ness. J. et al. Nudeic Acids Res. f 9:3345-3350. 1991 and PCT Intemational Publication WO 94/00600, both 

<o of which are incorporated herein by reference. When the solid support is glass or silicon, suitable methods of applying 
a layer of PEI are found in, e.g., Wassemian, B.P. Biotechnology and BioengineeringXXIl:27U2Q7, 1980; and D'Souza. 
S.F. Biotechnology Letters 8:643-648, 1986. 

[0456] Preferably the PEI coating is covalently attached to the solid substrate. When the solid substrate is glass or 
silicon, the PEI coating may be covalently bound to the substrate using silylating chemistry. For example. PEI having 

45 reactive siloxy endgroups is commercially available fi^om Gelest, Inc. (Tullytown. PA). Such reactive PEI may be con- 
tacted with a glass slide or silicon wafer, and after gentle agitation, the PEI will adhere to the substrate. Alternatively, 
a bifunctional silylating reagent may be employed. According to this process, the glass or silicon substrate is treated 
with the bifunctional silylating reagent to provide the substrate with a reactive surface. PEI Is then contacted with the 
reactive surface, and covalently binds to the surface through the bifunctional reagent. 

50 [0457] The biomolecules being placed into the array forniat are originally present in a so-called "arraying solution". 
In order to place biomolecule in discrete regions on the PEI-coated substrate, the an-aying solution preferably contains 
a thickening agent at a concentration of about 35 vol% to about 80 vol% based on the total volume of the composition, 
a biomolecule which is preferably an oligonucleotide at a concentration ranging from 0.001 fig/mL to 10 fig/mL. and 
water. 

55 [0458] The concentration of the thickening agent is 35% V/V to 80% VA/ for liquid thickening agents such as glycerol. 
The prefen-ed concentration of thickening agent in the composition depends, to some extent, on the temperature at 
which the arraying is performed. The lower the an-aying temperature, the lower the concentration of thickening agent 
that needs to be used. The combination of temperature and liquid thickening agent concentration control permits arrays 
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to be made on most types of solid supports (e.g.. glass, wafers, nylon 6/6, nylon membranes, etc.). 
(0459J The presence of a thickening agent has the additional benefit of allowing the concurrent presence of low 
concentrations of various other materials to be present in combination with the biomolecule. For example 0,001% V/ 
V to 1% VA/ of detergents may be present in the arraying solution. This is useful because PGR buffer contains a small 

5 amount of Tween-20 or NP-40, and it is frequently desirable to array sample nucleic acids directly from a PGR vial 
without prior purification of the amplicons. The use of a thickening agent permits the presence of salts (for example 
f^JaGI, KCI, or MgGl2), buffers~(for example Tris), and/or chelating reag"ents (for example EDTA) to also \5e present in 
the arraying solution. The use of a thickening agent also has the additional benefit of permitting the use of cross-linking 
reagents and/or organic solvents to be present in the an-aying solution. As commercially obtained, cross-linking rea- 

10 gents are commonly dissolved in organic solvent such as DMSO. DMF. NMP, methanol, ethanol and the like. Gommonly 
used organic solvents can be used in arraying solutions of the invention at levels of 0.05% to 20% (V/V) when thickening 
agents are used. 

[0460] In general, the thickening agents impart increased viscosity to the an-aying solution. When a proper viscosity 
is achieved in the arraying solution, the first drop is the substantially the same size as, for example, the 100th drop 

15 deposited. When an improper viscosity is used in the arraying solution, the first drops deposited are significantly larger 
than latter drops which are deposited. The desired viscosity is between those of pure water and pure glycerin. 
[0461] The biomolecule in the array may be a nucleic acid polymer or analog thereof, such as PNA, phosphorothioates 
and methylphosphonates. Nucleic acid refers to both ribonucleic acid and deoxyribonucleic acid. The biomolecule may 
comprise unnatural and/or synthetic bases. The biomolecule may be single or double stranded nucleic acid polymer. 

20 [0462] A preferred biomolecule is an nucleic acid polymer, which includes oligonucleotides (up to about 1 00 nucle- 
otide bases) and polynucleotides (over about 100 bases). A preferred nucleic add polymer is formed from 15 to 50 
nucleotide bases. Another prefen-ed nucleic acki polymer has 50 to 1,000 nucleotide bases. The nucleic acid polymer 
may be a PGR product. PGR primer, or nucleic add duplex, to list a few examples. However, essentially any nucleic 
add type can be covalently attached to a PEI-coated surface when the nucleic acid contains a primary amine, as 

25 disdosed below. The typical concentration of nudeic acid polymer in the arraying solution is 0.001 -1 O^g/mL, preferably 
0.01-1 fig/mL. and more preferably 0.05-0.5 ng/mL. 

[0463] Preferred nudeic add polymers are "amine-modified" in that they have been modified to contain a primary 
amine at the 5'-end of the nudeic add polymer, preferably with one or more methylene (-CH2-) groups disposed between 
the primary amine and the nudeic acid portion of the nucleic acid polymer. Six is a preferred number of methylene 

30 groups. Amine-modified nucleic acid polymers are prefen-ed because they can be covalently coupled to a solid support 
through the 5*-amine group. PGR products can be arrayed using 5-hexylamine modified PGR primers. Nudeic acid 
duplexes can be arrayed after the introduction of amines by nick translation using aminoallyl-dUTP (Sigma. St. Louis, 
MO). Amines can be introduced into nudeic acids by polymerases such as terminal transferase with amino allyl-dUTP 
or by ligation of short amine-containing nucleic acid polymers onto nucleic acids by ligases. 

35 [0464] Preferably, the nudeic acid polymer is activated prior to be contacted with the PEI coating. This can be con- 
veniently accomplished by combining amine-functionalized nudeic acid polymer with a multi-functional amine-reactive 
chemical such as trichlorotriazine. When the nucleic acid polymer contains a 5'-amine group, that 5'-amine can be 
reacted with trichlorotriazine, also known as cyanuric chloride (Van Ness et al.. Nudeic Acids Res. f 9f2;:3345-3350. 
1 991 ) Preferably, an excess of cyanuric chloride is added to the nudeic acid polymer solution, where a 1 0- to 1 000-fold 

40 molar excess of cyanuric chloride over the number of amines in the nucleic acid polymer in the arraying solution is 
prefen^ed. In this way, the majority of amine-terminated nucleic acid polymers have reacted with one molecule of trichlo- 
rotriazine. so that the nudeic acid polymer becomes terminated with dichlorotriazine. 

[0465] Preferably, the an^ying solution is buffered using a common buffer such as sodium phosphate, sodium borate, 
sodium carbonate, or Tris HGI. A prefen-ed pH range for the an-aying solution is 7 to 9, with a preferred buffer being 

45 freshly prepared sodium borate at pH 8.3 to pH 8.5. To prepare a typical an-aying solution, hexylamine-modified nudeic 
acid polymer is placed in 0.2 M sodium borate, pH 8,3, at 0.1 ^ig/mL. to a total volume of 50 ^il. Ten ^1 of a 15 mg/mL 
solution of cyanuric chloride is then added, and the reaction is allowed to proceed for 1 hour at 25 G with constant 
agitation. Glycerol (Gibco Brf®, Grand Island, NY) is added to a final concentration of 56%. 
[0466] The blomolecular arraying solutions may be applied to the PEI coating by any of the number of techniques 

50 currently used in microfabrication. For example, the solutions may be placed into an Ink jet print head, and ejected 
from such a head onto the coating. 

[0467] A preferred approach to delivering biomolecular solution onto the PEI coating employs a modified spring 
probe. Spring probes are available from several vendors induding Everett Gharies (Pomona, GA), Interconnect Devices 
Inc. (Kansas Gity, Kansas) and Test Gonnections Inc., (Upland, GA). In order for the commerdally available spring 
55 probes as described above to satisfactorily function as liquid deposition devices according to the present invention, 
approximately 1/1 000th to 5/1 000th of an inch of metal material must be removed from the tip of the probe. The process 
must result in a flat surface which is perpendicular to the longitudinal axis of the spring probe. The removal of approx- 
imately 1/1 000th to 5/1 000th of an inch of material firom the bottom of the tip is preferred and can be accomplished 
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easily with a very Tine grained wet stone. Specific spring probes which are commercially available and may be modified 
to provide a planar tip as described above include the XP54 probe manufactured by Ostby Barton (a division of Everett 
Charles (Pomona, CA)); the SPA 25P probe manufactured by Everett Charles (Pomona, CA) and 43-P fluted spring 
probe from Test Connections Inc., (Upland. CA). 
5 [0468] The arraying solutions as described above may be used directly in an an-aying process. That is, the activated 
nucleic acid polymers need not be purified away from unreacted cyanuric chloride prior to the printing step. Typically 
ttie reaction whidi attaches the activated nucleic acid to'the solid support is allowed to proceed for t to 20 hours at 20 
to 50 C. Preferably, the reaction time is 1 hour at 25 C. 

[0469] The arrays as described herein are particulariy useful in conducting hybridization assays, for example, using 
10 CMST labeled probes. However, in order to perform such assays, the amines on the solid support must be capped 
prior to conducting the hybridization step. This may be accomplished by reacting the solid support with 0.1-2.0 M 
succinic anhydride. The preferred reaction conditions are 1 .0 M succinic anhydride in 70% m-pyrol and 0.1 M sodium 
borate. The reaction typically is allowed to occur for 1 5 minutes to 4 hours with a prefen-ed reaction time of 30 minutes 
at 25 C. Residual succinic anhydride is removed with a 3x water wash. 
15 [0470] The solid support is then incubated with a solution containing 0.1-5 M glycine in 0.1-10.0 M sodium borate at 
pH 7- 9. This step "caps" any dichloro-triazine which may be covalently bound to the PEI surface by conversion into 
monochlorotriazine. The prefen-ed conditions are 0.2 M glycine in 0.1 M sodium borate at pH 8.3. The solid support 
may then be washed with detergent-containing solutions to remove unbound materials, for example, trace NMP Pref- 
erably, the solid support is heated to 95 C in 0.01 M NaCI, 0.05 M EDTA and 01 M Tris pH 8.0 for 5 minutes. This 
20 heating step removes non-covalently attached nucleic acid polymers, such as PCR products. In the case where double 
strand nucleic acid are arrayed, this step also has the effect of converting the double strand to single strand form 
(denaturation). 

[0471] The arrays are may be inten-ogated by probes (e.g., oligonucleotides, nucleic acid fragments, PCR products, 
etc.) which may be tagged with, for example CMST tags as described herein, radioisotopes, fluorophores or biotin. 
25 The methods for biotinylating nudeic acids are well known in the art and are adequately described by Pierce (Avidin- 
Biotin Chemistry: A Handtwok, Pierce Chemical Company, 1992, Rockford Illinois). Probes are generally used at 0.1 
ng/mL to lO/fxg/mL in standard hybridization solutions that include GuSCN, GuHCI, formamide, etc. {see Van Ness 
and Chen. Nucleic Adds Res.. 19:5143-5151, 1991). 

[0472] To detect the hybridization event (/'.e., the presence of the biotin), the solid support is incubated with strepta- 
30 vidln/horseradish peroxidase conjugate. Such enzyme conjugates are commercially available from, for example, Vector 
Laboratories (Burlingham, CA). The streptavidin binds with high affinity to the biotin molecule bringing the horseradish 
peroxidase into proximity to the hybridized probe. Unbound streptavidin/horseradish peroxidase conjugate is washed 
away in a simple washing step. The presence of horseradish peroxidase enzyme Is then detected using a precipitating 
substrate in the presence of peroxkJe and the appropriate buffers. 
35 [0473] A blue enzyme product deposited on a reflective surface such as a wafer has a many-fold lower level of 
detection (LLD) compared to that expected for a colorimetric substrate. Furthermore, the LLD is vastly different for 
different colored enzyme products. For example, the LLD for 4-methoxynapthol (which produces a precipitated blue 
product) per 50 diameter spot is approximately 1000 molecules, whereas a red precipitated substrate gives an LLD 
about 1000-fold higher at 1 ,000,000 molecules per 50 diameter spot. The LLD is determined by interrogating the 
40 surface with a microscope (such as the Axiotech microscope commercially available firom Zeiss) equipped with a visible 
light source and a CCD camera (Princeton Instruments. Princeton. NJ). An image of approximately 10.000 fiM x 10,000 
can be scanned at one time. 

[0474] In order to use the blue colorimetric detection scheme, the surface must be very dean after the enzymatic 
reaction and the wafer or slide must be scanned in a dry state. In addition, the enzymatic reaction must be stopped 

45 prior to saturation of the reference spots. For horseradish peroxidase this is approximately 2-5 minutes. 

[0475] It is also possible to use chemiluminescent substrates for alkaline phosphatase or horesradish peroxidase 
(HRP), or fluoroescence substrates for HRP or alkaline phosphatase. Examples include the dioxetane substrates for 
alkaline phosphatase available from Pericin Elmer or Attophos HRP substrate from JBL Scientific (San LajIs Obispo, CA). 
[0476] The following examples are offered by way of illustration, and not by way of limitation. 

50 [0477] Unless othenA^ise stated, chemicals as used In the examples may be obtained from Aldrich Chemical Com- 
pany. Milwaukee, Wl. The following abbreviations, with the indicated meanings, are used herein: 

ANP = 3-(Fmoc-amino)-3-(2-nitrophenyl)propionic acid 
NBA = 4-(Fmoc-aminomethyl)-3Hiitrobenzoic acid 
55 HATU = 0-7-azaben2otriazol-1-yl-N,N.N*,N -tetramethyluronium hexafluorophosphate 

DIEA = diisopropylethylamine 
MCT = monochlorotriazine 
NMM = 4-methylmorphoIine 
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NMP = N-methylpyrrolidone 

ACT357 = ACT357 peptide synthesizer from Advanced ChemTech. Inc.. Louisville, KY 
ACT = Advanced ChemTech, Inc., Louisville, KY 
NovaBiochem = CalBiochem-NovaBiochem International, San Diego. CA 
5 TFA = Trifluoroacetic acid 

Tfa = Trifluoroacetyl 

INTP = N-Methylisonipecotic~acid" - - - _ 

Tfjp = Tetrafluorophenyl 
DIAEA = 2-(Diisopropylamino)ethylamine 
10 MCT = monochlorotriazene 

5'-AH-0DN = 5'-anninohexyl-tailed oligodeoxynucleotide 

EXAMPLES 

15 EXAMPLE I 

PREPARATION OF ACID LABILE LINKERS FOR USE IN CLEAVABLE-MW-IDENTIFIER SEQUENCING 

A. Synthesis of Pentafluorophenyl Esters of Chemically Cleavable Mass Spectroscopy Tags, to Liberate Tags with 
2° Caft)oxyl Amide Tenfnini 

[0478] Figure 1 shows the reaction scheme. 

Step A. TentaGel S AC resin (compound 11; available from ACT; 1 eq.) is suspended with DMF in the collection 
25 vessel of the ACT357 peptide synthesizer (ACT). Compound I (3 eq.), HATU (3 eq.) and DIEA (7.5 eq.) in DMF 

are added and the collection vessel shaken for 1 hr. The solvent is removed and the resin washed with NMP (2X), 
MeOH {2X). and DMF {2X). The coupling of I to the resin and the wash steps are repeated, to give compound 111. 

Step B. The resin (compound III) is mixed with 25% piperidine in DMF and shaken for 5 min. The resin is filtered, 
30 then mixed with 25% piperidine in DMF and shaken for 10 min. The solvent is removed, the resin washed with 

NMP (2X), MeOH (2X), and DMF (2X). and used directly in step C, 

Step C. The deprotected resin from step B is suspended in DMF and to it Is added an FMOC-protected amino 
acid, containing amine functionality in its side chain (compound IV. e.g., alpha-N-FMOC-3-(3-pyridyl}-alanine. avail- 
35 able from Synthetech, Albany, OR; 3 eq.), HATU (3 eq.). and DIEA (7.5 eq.) in DMF. The vessel is shaken for 1 

hr. The solvent is removed and the resin washed with NMP (2X), MeOH (2X). and DMF (2X). The coupling of IV 
to the resin and the wash steps are repeated, to give compound V. 

Step D. The resin (compound V) is treated with piperidine as described in step B to remove the FMOC group. The 
^0 deprotected resin is then divided equally by the ACT357 from the collection vessel into 16 reaction vessels. 

Step E . The 16 aliquots of deprotected resin from step D are suspended in DMF. To each reaction vessel is added 
the appropriate carboxylic acid Vl^.^g (RM6CO2H; 3 eq.), HATU (3 eq.), and DIEA (7.5 eq.) in DMF. The vessels 
are shaken for 1 hr. The solvent is removed and the aliquots of resin washed with NMP (2X), MeOH (2X), and 
^5 DMF (2X). The coupling of VI ^.^^ to the aliquots of resin and the wash steps are repeated, to give compounds Vll^.^g. 

Step F . The aliquots of resin (compounds Vll^.^g) are washed with CH2CI2 (3X). To each of the reaction vessels 
is added 1% TFA in CH2CI2 and the vessels shaken for 30 min. The solvent is filtered from the reaction vessels 
Into individual tubes. The aliquots of resin are washed with CH2CI2 (2X) and MeOH (2X) and the filtrates combined 
50 into the individual tubes. The individual tubes are evaporated in vacuo, providing compounds Vlll^.^g. 

Step G. Each of the free carboxylic acids Vllli.^g dissolved in DMF. To each solution is added pyridine (1.05 
eq.), followed by pentafluorophenyl trifluoroacetate (1.1 cq.). The mixtures are stirred for 45 min, at room temper- 
ature. The solutions are diluted with EtOAc. washed with 1 M aq. citric acid (3X) and 5% aq. NaHCOa (3X). dried 
55 over Na2S04. filtered, and evaporated in vacuo, providing compounds IX^.^g. 
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A. Synthesis of Pentafiuorophenyt Esters of Chemically Cleavabie Mass Spectroscopy Tags, to Liberate Tags with 
Carboxyl Acid Termini 

[0479] Figure 2 shows the reaction scheme. 

5 

, Step A. 4-(Hydroxymethyl)phenoxybutyric acid (compound I; 1 eq.) is combined with DIEA (2.1 eq.) and ally! bro- 
mide (2.1 eq.) in CHCI3 and heated to reflux for 2 hr. The~mixture is diluted with EtOAc", washed with 1 N HCI (2X),- - 
pH 9.5 carlx)nate buffer (2X), and brine (IX), dried over Na2S04, and evaporated in vacuo to give the allyl ester 
of compound 1. 

10 

Step B. The allyt ester of compound 1 from step A (1 .75 eq.) is combined in CH2CI2 with an FMOC-protected amino 
acid containing amine functionality in its side chain (compound II. e.g.. alpha-N-FMOC-3-(3-pyridyl)-alanine, avail- 
able from Synthetech, Albany, OR; 1 eq.). N-methylmorpholine (2.5 eq.), and HATU (1.1 eq.), and stirred at room 
temperature for 4 hr. The mixture is diluted with CH2CI2. washed with 1 M aq. citric acid (2X), water (IX), and 5% 
15 aq, NaHC03 (2X), dried over Na2S04, and evaporated in vacuo. Compound III is isolated by flash chromatography 

(CH2Cl2-> EtOAc). 

Step C . Compound III is dissolved in CH2CI2, Pd(PPh3)4 (0.07 eq.) and N-methylaniline (2 eq.) are added, and 
the mixture stirred at room temperature for 4 hr. The mixture is diluted with CH2CI2. washed with 1 M aq. citric acid 
20 (2X) and water (1X). dried over Na2S04, and evaporated in vacuo. Compound iV is isolated by flash chromatog- 

raphy (CH2Cl2-> EtOAc + HOAc). 

Step D . TentaGel S AC resin (compound V; 1 eq.) is suspended with DMF in the collection vessel of the ACT357 
peptide synthesizer (Advanced ChemTech Inc. (ACT), Louisville, KY). Compound IV (3 eq.). HATU(3 eq.) and 
25 DIEA (7.5 eq.) in DMF are added and the collection vessel shaken for 1 hr. The solvent is removed and the resin 

washed with NMP (2X). MeOH (2X). and DMF (2X). The coupling of IV to the resin and the wash steps are repeated, 
to give compound VI. 

Step E. The resin (compound VI) is mixed with 25% piperidine in DMF and shaken for 5 min. The resin is filtered, 
30 then mixed with 25% piperidine in DMF and shaken for 10 min. The solvent is removed and the resin washed with 

NMP (2X), MeOH (2X). and DMF (2X). The deprotect^d resin is then divided equally by the ACT357 from the 
collection vessel into 16 reaction vessels. 

Step F. The 16 aliquots of deprotected resin from step E are suspended in DMF. To each reaction vessel is added 
35 the appropriate carboxylic acid Vll^.^g (^1-16^02^^ 3 eq.). HATU (3 eq.), and DIEA (7.5 eq.) in DMF. The vessels 

are shaken for 1 hr. The solvent is removed and the aliquots of resin washed with NMP (2X), MeOH (2X), and DMF 
(2X). The coupling of VII^.^q to the aliquots of resin and the wash steps are repeated, to give compounds Vllt^.i^. 

Step G . The aliquots of resin (compounds Vlllt.is) are washed with CH2CI2 (3X). To each of the reaction vessels 
^ is added 1% TFA in CH2CI2 and the vessels shaken for 30 min. The solvent is filtered from the reaction vessels 

into individual tubes. The aliquots of resin are washed with CH2CI2 (2X) and MeOH (2X) and the filtrates combined 
into the individual tubes. The individual tubes are evaporated in vacuo, providing compounds IX^.^g. 

Step H . Each of the free carboxylic acids 1X^.^3 is dissolved in DMF. To each solution is added pyridine (1 .05 eq.). 
45 followed by pentafluorophenyl trifluoroacetate (1.1 eq.). The mixtures are stirred for 45 min. at room temperature. 

The solutions are diluted with EtOAc, washed with 1 M aq. citric acid {3X) and 5% aq. NaHCOa (3X), dried over 
Na2S04. filtered, and evaporated in vacuo, providing compounds X^.^g. 

EXAMPLE 2 

50 

DEMONSTRATION OF PHOTOLYTIC CLEAVAGE OF T-L-X 

[0480] A T-L-X compound as prepared in Example 1 1 was in-adiated with near-UV light for 7 min at room temperature, 
A Rayonett fluorescence UV lamp (Southern New England Ultraviolet Co.. Middletown. CT) with an emission peak at 
55 350 nm is used as a source of UV light. The lamp is placed at a 15-cm distance from the Petri dishes with samples. 
SDS gel electrophoresis shows that >85% of the conjugate is cleaved under these conditions. 
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EXAMPLE 3 

PREPARATION OF FLUORESCENT LABELED PRIMERS AND DEMONSTRATION OF CLEAVAGE OF 
FLUOROPHORE (does not relate to claimed subject-matter) 

5 

_ , Synthesis and Purification of Oligonudeotides 



[0481] The oiigonucleotides (ODNs) are prepared on automated DNA synthesizers using the standard phosphora< 
midite chemistry supplied by the vendor, or the H-phosphonate chemistry (Glenn Research Sterling, VA). Appropriately 

10 blocked dA« dG, dC. and T phosphoramidites are commercially available in these forms, and synthetic nucleosides 
may readily be converted to the appropriate form. The oligonucleotides are prepared using the standard phosphora- 
midite supplied by the vendor, or the H-phosphonate chemistry. Oligonucleotides are purified by adaptations of standard 
methods. Oligonucleotides with 5'-trityl groups are chromatographed on HPLC using a 12 micrometer. 300 # Rainin 
(Emeryville. CA) Dynamax C-8 4,2x250 mm reverse phase column using a gradient of 15% to 55% MeCN in 0.1 N 

^5 Et3NH*0Ac-, pH 7.0, over 20 min. When detritylation is perfonmed, the oligonucleotides are further purified by gel 
exclusion chromatography. Analytical checks for the quality of the oligonudeotides are conducted with a PRP-column 
(Alltech, Deerfield, IL) at alkaline pH and by PAGE. 

[0482] Preparation of 2,4.6-trichlorotriazine derived oligonucleotides: 10 to 1000 ^ig of 5'-tenminal amine linked oli- 
gonudeotide are reacted with an excess recrystallized cyanuric chloride in 10% n-methyl-pyn-olidone in alkaline (pH 
20 8.3 to 8.5 preferably) buffer at IS^'C to 25*^0 for 30 to 120 minutes. The final reaction conditions consist of 0.15 M 
sodium t)orate at pH 8.3, 2 mg/ml recrystallized cyanuric chloride and 500 ug/ml respective oligonudeotide. The un- 
reacted cyanuric chloride Is removed by size exdusion chromatography on a G-50 Sephadex (Pharmacia, Piscataway, 
NJ) column, 

[0483] The activated purified oligonudeotide is then reacted with a 100-fold molar excess of cystamine in 0.15 M 
25 sodium borate at pH 8.3 for 1 hour at room temperature. The unreacted cystamine is removed by size exclusion chro- 
matography on a G-50 Sephadex column. The derived ODNs are then reacted with amine-reactive fluorochromes. 
The derived ODN preparation is divided into 3 portions and each portion is reacted with either (a) 20-fold molar excess 
of Texas Red sulfonyl chloride (Molecular Probes. Eugene, OR), with (b) 20-fold molar excess of Lissamine sulfonyl 
chloride (Molecular Probes, Eugene, OR), (c) 20-fold molar excess of fluorescein isothiocyanate. The final reaction 
30 conditions consist of 0.15 M sodium borate at pH 8.3 for 1 hour at room temperature. The unreacted fluorochromes 
are removed by size exdusion chromatography on a G-50 Sephadex column. 

[0484] To cleave the fluorochrome from the oligonudeotide, the ODNs are adjusted to 1 x 1 0~^ molar and then dilutions 
are made (12, 3-fold dilutions) in TE (TE is 0.01 M Tris, pH 7.0, 5 mM EDTA). To 100 jil volumes of ODNs 25 \i\ of 0.01 
M dithiothreitol (DTT) is added. To an identical set of controls no DDT is added. The mixture is incubated for 15 minutes 

35 at room temperature. Fluorescence is measured in a black miaotiter plate. The solution is removed from the incubation 
tubes (1 50 microliters) and placed in a black microtiter plate (Dynatek Laboratories, Chantilly, VA). The plates are then 
read directly using a Fluoroskan II fluorometer (Flow Laboratories, McLean, VA) using an excitation wavelength of 495 
nm and monitoring emission at 520 nm for fluorescein, using an excitatbn wavelength of 591 nm and monitoring 
emission at 612 nm for Texas Red, and using an excitation wavelength of 570 nm and monitoring emission at 590 nm 

^0 for lissamine, with the results set forth in TABLE 1 . 



TABLE 1 



45 



50 



55 



Moles of Ruorochrome 


RFU non-deaved 


RFU deaved 


RFU free 


1.0x105M 


6.4 


1200 


1345 


3.3x 106M 


2.4 


451 


456 


1,1x106M 


0.9 


135 


130 


3.7x107M 


0-3 


44 


48 


1.2x lO^M 


0.12 


15.3 


16.0 


4.1 X lO^M 


0.14 


4.9 


5.1 


1,4x108M 


0,13 


2.5 


2.8 


4.5 x109m 


0.12 


0.8 


0,9 



[0485] The data indicate that there is about a 200-fold inaease in relative fluorescence when the fluorochrome is 
deaved from the ODN. 



81 



EP 0 990 047 B1 



EXAMPLE 4 

PREPARATION OF TAGGED Ml 3 SEQUENCE PRIMERS AND DEMONSTRATION OF CLEAVAGE OF TAGS 

[0486] Preparation of 2,4,6-trichlorotria2ine derived oligonucleotides: 1000 of 5'-terminal amine linked oligonu- 
cleotide (5'-hexylamine-TGTAAAACGACGGCCAGT-3") (Seq. ID No. 1) are reacted with an excess recrystallized cy- 
anufic chloride in" 10% n-methyl^jyrrolidone aIkaline'(pH 8.3 to 8:5 preferably) buffer at- 19 to 25- C for 30 to 120 
nfilnutes. The final reaction conditions consist of 0.15 M sodium borate at pH 8.3. 2 mg/ml recrystallized cyanuric 
chloride and 500 ug/m! respective oligonucleotide. The unreacted cyanuric chloride is removed by size exclusion chro- 
matography on a G-50 Sephadex column. 

[0487] The activated purified oligonucleotide is then reacted with a 100-fold molar excess of cystamine in 0.15 M 
sodium borate at pH 8.3 for 1 hour at room temperature. The unreacted cystamine is removed by size exclusion chro- 
matography on a G-50 Sephadex column. The derived ODNs are then reacted with a variety of amides. The derived 
ODN preparation is divided into 12 portions and each portion is reacted (25 molar excess) with the pentafluorophenyl- 
esters of either: (1 ) 4-methoxybenzoic acid, (2) 4-fluorobenzoic acid, (3) toluic acid, (4) benzoic acid, (5) indole-3-acetic 
acid, (6) 2,6-difIuorobenzoic acid. (7) nicotinic acid N-oxide, (8) 2-nitrobenzoic acid, (9) 5-acetylsalicylic acid, (10) 
4-ethoxybenzoic acid, (11) cinnamic acid, (12) 3-aminonicotinic acid. The reaction is for 2 hours at 37°C in 0.2 M 
NaBorate pH 8.3. The derived ODNs are purified by gel exclusion chromatography on G-50 Sephadex. 
[0488] To cleave the tag from the oligonucleotide, the ODNs are adjusted to 1 x 10-^ molar and then dilutions are 
made (12, 3-fold dilutions) in TE (TE is 0.01 M Tris, pH 7.0, 5 mM EDTA) with 50% EtOH (VA/). To 100 ^il volumes of 
ODNs 25 ^li of 0.01 M dithiothreitol (DTT) is added. To an identical set of controls no DDT is added. Incubation is for 
30 minutes at room temperature. NaCi is then added to 0.1 M and 2 volumes of EtOH is added to precipitate the ODNs. 
The ODNs are removed from solution by centrifugation at 14.000 x G at 4''C for 15 minutes. The supematants are 
reserved, dried to completeness. The pellet is then dissolved in 25 ^1 MeOH. The pellet is then tested by mass spec- 
trometry for the presence of tags. 

[0489] The mass spectrometer used in this wojk is an extemal ion source Fourier-transform mass spectrometer 
(FTMS). Samples prepared for MALDl analysis are deposited on the tip of a direct probe and inserted into the ion 
source. When the sample is irradiated with a laser pulse, ions are extracted from the source and passed Into a long 
quadrupoie ion guide that focuses and transports them to an FTMS analyzer cell located InskJe the bore of a super- 
conducting magnet. 

[0490] The spectra yield the following information. Peaks varying in intensity from 25 to 100 relative intensity units 
at the following molecular weights: (1) 212.1 amu indicating 4-methoxybenzoic acid derivative, (2) 200.1 indicating 
4-fluorobenzoic acid derivative, (3) 196.1 amu indicating toluic acid derivative, (4) 182.1 amu indicating benzoic acid 
derivative. (5)235.2 amu Indicating indole-3-acetlc add derivative, (6)218.1 amu indicating 2,6-difluorobenzoic deriv- 
ative, (7)199.1 amu indicating nicotinic acid N-oxide derivative. (8) 227.1 amu indicating 2-nitrobenzamide, (9) 179.18 
amu indicating 5-acetylsalicylic acid derivative, (10) 226.1 amu indicating 4-ethoxybenzoic acid derivative, (11) 209.1 
amu indicating cinnamic acid derivative. (12) 198.1 amu indicating 3-aminonicotinic acid derivative. 
[0491] The results indicate that the MW-ldentifters are cleaved from the primers and are detectable by mass spec- 
trometry. 

EXAMPLES 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA Ri^-LYS(e-INIP)-ANP-TFP 

[0492] Figure 3 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L^), where is an activated 
ester (specifically, tetrafluorophenyl ester), L^ is an ortho-nitrobenzylamine group with L^ being a methylene group that 
links and L^, T has a modular structure wherein the carboxylic acid group of lysine has been joined to the nitrogen 
atom of the L^ benzylamine group to forni an amide bond, and a variable weight component R1.36, (where these R 
groups correspond to T^ as defined herein, and may be introduced via any of the specific carboxylic acids listed herein) 
is tx}nded through the a-amino group of the lysine, while a mass spec sensitivity enhancer group (introduced via N- 
methylisonipecotic acid) is bonded through the e-amino group of the lysine. 
[0493] Referring to Figure 3: 

Step A . NovaSyn HMP Resin (available from NovaBiochem; 1 eq.) is suspended with DMF in the collection vessel 
of the ACT357. Compound I (ANP available from ACT; 3 eq.). HATU (3 eq.) and NMM (7.5 eq.) In DMF are added 
and the collection vessel shaken for 1 hr. The solvent is removed and the resin washed with NMP (2X). MeOH 
(2X), and DMF (2X). The coupling of I to the resin and the wash steps are repeated, to give compound II. 
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Step B . The resin (compound H) is mixed with 25% piperidine in DMF and shaken for 5 min. The resin is filtered, 
then mixed with 25% piperidine in DMF and shaken for 10 min. The solvent is removed, the resin washed with 
NMP (2X). MeOH (2X). and DMF (2X). and used directly in step C. 

Step C. The deprotected resin from step B is suspended in DMF and to it is added an FMOC-protected amino 
_ acid, containing a prpte_cted_aniine functionality In its side chain (Fmoc-Lyslne(Aloc)-OH, available from PerSeptive 
Biosystems; 3 eq.). HATU (3 eq.). and NMM (7,5 eq.) in OMR The vessells shaken for 1 hr. The solvent is removed 
and the resin washed with NMP (2X), MeOH (2X), and DMF (2X). The coupling of Fmoc-Lys(Aloc)-OH to the resin 
and the wash steps are repeated, to give compound IV. 

Step D . The resin (compound IV) Is washed with CH2CI2 (2X), and then suspended in a solution of (PPh3)4Pd (0) 
(0.3 eq.) and PhSiHj (10 eq.) in CH2CI2. The mixture is shaken for 1 hr. The solvent is removed and the resin is 
washed with CH2CI2 (2X). The palladium step is repeated. The solvent is removed and the resin is washed with 
CH2CI2 (2X). N.N-dilsopropylethylammonlum diethyldithiocarbamate in DMF (2X). DMF (2X) to give compound V. 

Step E . The deprotected resin from step D is coupled with N-methylisonipecotic acid as described in step C to 
give compound VI. 

Step F . The Fmoc protected resin VI is divided equally by the ACT357 from the collection vessel into 36 reaction 
vessels to give compounds Vl^^. 

StepG. The resin (compounds VI, ^g) *s treated with piperidine as described in step B to remove the FMOC group. 

Step H : The 36 aliquots of deprotected resin from step G are suspended in DMF. To each reaction vessel is added 
the appropriate carboxylic acid (Ri^C02H; 3 eq.). HATU (3 eq.), and NMM (7,5 eq.) in OMR The vessels are 
shaken for 1 hr. The solvent Is removed and the aliquots of resin washed with NMP (2X), MeOH (2X), and DMF 
(2X). The coupling of Ri.^6C02^ aliquots of resin and the wash steps are repeated, to give compounds 

Vllli.36- 

Step 1. The aliquots of resin (compounds Vlllt.36) are washed with CH2CI2 (3X). To each of the reaction vessels 
is added 90:5:5 TFA:H20:CH2Cl2 and the vessels shaken for 120 min. The solvent is filtered from the reaction 
vessels Into individual tubes. The aliquots of resin are washed with CH2CI2 (2X) and MeOH (2X) and the filtrates 
combined into the individual tubes. The individual tubes are evaporated /n vacuo, providing compounds 1X1.35. 

Step J. Each of the free carboxylic acids 1X^.3^ is dissolved in DMF. To each solution is added pyridine (1 .05 eq.), 
followed by tetrafluorophenyl trifluoroacetate (1.1 eq.). The mixtures are stin-ed for 45 min. at room temperature. 
The solutions are diluted with EtOAc. washed with 5% aq. NaHCOa (3X), dried over Na2S04, filtered, and evap- 
orated in vacuo, providing compounds X^^^q- 

EXAMPLES 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA Ri.36-LYS(e-lNIP)-NBA-TFP 

[0494] Figure 4 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L^), where L^ is an activated 
ester (specifically, tetrafluorophenyl ester), L^ is an ortho-nitrobenzylamine group with L^ being a direct bond between 
Lh and L2, where L^, is joined directly to the aromatic ring of the L^ group, T has a modular structure wherein the 
cari>oxylic acid group of lysine has been joined to the nitrogen atom of the L^ benzylamine group to fomi an amide 
bond, and a variable weight component R^.^, (where these R groups correspond to T^ as defined herein, and may 
be introduced via any of the specific cart>oxylic acids listed herein) is bonded through the a-amino group of the lysine, 
while a mass spec enhancer group (introduced via N-methylisonipecotic acid) is bonded through the e-amino group of 
the lysine. 

[0495] Referring to Figure 4 

Step A . NovaSyn HMP Resin Is coupled with compound I (NBA prepared according to the procedure of Brown et 
al., Molecular Diversity, 1 , 4 (1 995)) according to the procedure described in step A of Example 5, to give compound 
II. 

Steps B-J . The resin (compound II) is treated as described in steps B-J of Example 5 to give compounds X^.3g. 
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EXAMPLE 7 



PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA INIP-LYS (e-Ri.36)-ANP-TFP 

5 [0496] Figure 5 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L^). where is an activated 
ester (specifically, tetrafluprgphenyl esjer), L2 is an ortho-nitrobenzylamine group with being a methylene group that 
links Lh and L^, T has a modular structure wherein the cartxjxylic acid group of lysine has been joined to the nitrogen" " 
atom of the benzylamine group to fonm an amide bond, and a variable weight component R,.36. (where these R 
groups conrespond to as defined herein, and may be introduced via any of the specific carboxylic acids listed herein) 

10 is bonded through the E-amino group of the lysine, white a mass spec sensitivity enhancer group (introduced via 
methylisonipecotic acid) is bonded through the a-amino group of the lysine. 
[04971 Referring to Figure 5: 

Steps A-C . Same as in Example 5. 

15 

Step D . The resin (compound IV) is treated with piperidine as described in step B of Example 5 to remove the 
FMOC group. 

Step E . The deprotected a*amine on the resin in step D is coupled with N-methylisonipecotic add as described in 
20 step C of Example 5 to give compound V. 

Step R Same as in Example 5. 

Step G. The resin (compounds Vl^^g) ^^e treated with palladium as described in step D of Example 5 to remove 
25 the Aloe group. 

Steps H-J . The compounds are prepared in the same manner as in Example 5. 

EXAMPLES 

30 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA Ri.36-GLU(y-DIAEA)-ANP-TFP 

[0498] Figure 6 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L^), where L^ is an activated 
ester (specifically, tetrafluorophenyl ester), L^ is an orthonrjitrobenzylamine group with L^ being a methylene group that 

35 links Lj, and L^, T has a modular structure wherein the a-carboxylic acid group of glutamatic acid has been joined to 
the nitrogen atom of the L^ benzylamine group to fonn an amide bond, and a variable weight component R^^q, (where 
these R groups correspond to T^ as defined herein, and may be introduced via any of the specific carboxylic acids 
listed herein) is bonded through the aa-amino group of the glutamic acid, while a mass spec sensitivity enhancer group 
(introduced via 2-(diisopropylamino)ethylamine) is bonded through the y<»ri3oxylic acid of the glutamic acid. 

40 [0499] Referring to Figure 6: 

Steps A-B. Same as in Example 5. 

Step C . The deprotected resin (compound III) is coupled to Fmoc-Glu-(OAI)-OH using the coupling method de- 
45 scribed in step C of Example 5 to give compound IV. 

Step D . The allyl ester on the resin (compound IV) is washed with CH2CI2 (2X) and mixed with a solution of 
(PPh3)4Pd (0) (0.3 eq.) and N-methylaniline (3 eq.) in CH2CI2. The mixture is shaken for 1 hr. The solvent is 
removed and the resin is washed with CH2CI2 (2X). The palladium step is repeated. The solvent is removed and 
50 the resin is washed with CH2CI2 (2X). N,N-diisopropylethylammonium dielhyldithiocartjamate in DMF (2X), DMF 

(2X) to give compound V. 

Step E. The deprotected resin from step D is suspended In DMF and activated by mixing HATU (3 eq.). and NMM 
(7,5 eq.). The vessels are shaken for 15 minutes. The solvent is removed and the resin washed with NMP (1X). 
55 The resin is mixed with 2-(diisopropylamino)ethylamine (3 eq.) and NMM (7.5 eq.). The vessels are shaken for 1 

hour. The coupling of 2-(diisopropylamino)ethylamtne to the resin and the wash steps are repeated, to give com- 
pound VI . 
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Steps F-J . Same as in Example 5. 
EXAMPLE 9 

5 PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA Ri^6-LYS{E-INfP)'ANP-LYS(e-NH2)-NH2 

(0500] Figure 7 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L^), where" L^ an amine 
(specifically, the e-amino group of a lysine-derived moiety), L^ is an ortho-nitrobenzylamine group with L^ being a 
cart)Oxamidosubstituted alkyleneaminoacylalkylene group that links and L^, T has a modular structure wherein the 

10 carboxylic acid group of lysine has been joined to the nitrogen atom of the L^ benzylamine group to form an amide 
bond, and a variable weight component R1.36. (where these R groups correspond to T^ as defined herein, and may 
be introduced via any of the specific carboxylic acids listed herein) is bonded through the a-amino group of the lysine, 
while a mass spec sensitivity enhancer group (introduced via N-methyiisonipecotic acid) is bonded through the e-amino 
group of the lysine. 

15 [0501] Referring to Figure 7: 

Step A . Fmoc-Lys(Boc)-SRAM Resin (available from ACT; compound I) is mixed with 25% piperidine in DMF and 
shaken for 5 min. The resin is filtered, then mixed with 25% piperidine in DMF and shaken for 10 min. The solvent 
is removed, the resin washed with NMP (2X). MeOH (2X). and DMF (2X), and used directly in step B. 

20 

Step B. The resin (compound II), ANP (available from ACT; 3 eq.). HATU (3 eq.) and NMM (7,5 eq.) in DMF are 
added and the collection vessel shaken for 1 hr. The solvent is removed and the resin wasfied with NMP (2X), 
MeOH (2X), and DMF (2X). The coupling of I to the resin and the wash steps are repeated, to give compound III. 

Steps C^ . The resin (compound III) is treated as in steps B-1 in Example 5 to give compounds X^^. 

EXAMPLE 10 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA Ri^6-*-YS(e-TFA)-LYS(e-IINP)-ANP-TFP 

30 

[0502] Figure 8 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L^,). where is an activated 
ester (specifically, tetrafluorophenyl ester), L^ is an ortho-nitrobcnzylamine group with L^ being a methylene group that 
links l_h and L^, T has a modular structure wherein the carboxylic acid group of a first lysine has been joined to the 
nitrogen atom of the L^ benzylamine group to fomi an amide bond, a mass spec sensitivity enhancer group (introduced 
35 via N-methylisonipecotic acid) is bonded through the e-amino group of the first lysine, a second lysine molede has 
been joined to the first lysine through the a-amino group of the first lysine, a molecular weight adjuster group (having 
a trifluoroacetyl structure) is t)onded through the e-amino group of the second lysine, and a variable weight component 
R^.36, (where these R groups correspond to T^ as defined herein, and may be introduced via any of the specific car- 
bo)^ic acids listed herein) is bonded through the a-amino group of the second lysine. Referring to Figure 8: 

40 

Steps A-E. These steps are identical to steps A-E in Example 5. 

Step F . The resin (compound VI) Is treated with piperidine as described in step B in Example 5 to remove the 
FMOC group. 

45 

Step G. The deprotected resin (compound VII) is coupled to Fmoc-Lys(Tfa)-OH using the coupling method de- 
scribed in step C of Example 5 to give compound VIII. 

Steps H-K. The resin (compound VIII) is treated as in steps F-%1 in Example 5 to give compounds Xl^^. 

50 

EXAMPLE 11 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA Ri.36-LYS(e-INIP)-ANP-5'-AH-ODN 

55 [0503] Figure 9 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = MOI, where MOl is a nucleic 
acid fragment, ODN) derived from the esters of Example 5 (the same procedure could be used with other T-L-X com- 
pounds wherein X is an activated ester). The MOl is conjugated to T-L through the 5' end of the MOl, via a phosphodi- 
ester - alkyleneamine group. 
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[0504] Referring to Figure 9: 

Step A. Compounds XII 1.36 are prepared according to a modified biotinylation procedure in Van Ness et al., Nucleic 
Acids Res., 19, 3345 (1991). To a solution of one of the 5'-aminohexyl oligonucleotides (compounds Xl^^g, 1 mg) 
5 in 200 mM sodium borate (pH 8.3, 250 mL) is added one of the Tetrafluorophenyl esters (compounds X^,26 ^o"^ 

Example A. 100-fold molar excess in 250 mL of NMP). The reaction is incubated overnight at ambient temperature. 
Tlie uni^eacted and hydrolyzed tetrafluorophenyl esters are removed from the compounds Xll,:36 by Sephadex G- 
50 chromatography. 

10 EXAMPLE 12 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA Ri.36-LYS(e-INIP)-ANP-LYS(e-(MCT-5'-AH-ODN)) 
-NH2 

IS [0505] Figure 10 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = MOI, where MOI is a nucleic 
acid fragment, ODN) derived from the amines of Example 11 (the same procedure could be used with other T-L-X 
compounds wherein X is an amine). The MOI is conjugated to T-L through the 5' end of the MOI, via a phosphodiester 
- alkyleneamine group. 
[0506] Referring to Figure 1 0: 

20 

Step A. The 5 -[6-(4,6-dichloro-1 ,3,5-triazin-2-ylamino)hexyl]oligonucleotides XII^^q ^re prepared as described in 
Van Ness et al.. Nucleic Acids Res., 19, 3345 (1991). 

Step B. To a solution of one of the 5'-[6-(4,6-dichloro-1 ,3,5-triazin-2-ylamino)hexyl]oligonucleotides (compounds 
25 Xlli.36) at a concentration of 1 mg/ml in 1 00 mM sodium borate (pH 8.3) was added a 1 00-fold molar excess of a 

primary amine selected from Ri.36-Lys(e-iNIP)-ANP-Lys(e-NH2)-NH2 (compounds X^.36 from Example 11). The 
solution is mixed overnight at ambient temperature. The unreacted amine is removed by ultrafiltration through a 
3000 MW cutoff membrane (Amicon. Beverly. MA) using H2O as the wash solution (3 X). The compounds Xlll^^g 
are isolated by reduction of the volume to 100 mL. 

30 

EXAMPLE 13 

DEMONSTRATION OF THE SIMULTANEOUS DETECTION OF MULTIPLE TAGS BY MASS SPECTROMETRY . 

35 [0507] This example provides a description of the ability to simultaneously detect multiple compounds (tags) by mass 
spectrometry. In this particular example, 31 compounds are mixed with a matrix, deposited and dried on to a solid 
support and then desorbed with a laser. The resultant ions are then introduced in a mass spectrometer 
[0508] The following compounds (purchased from Aldrich, Milwaukee, Wl) are mixed together on an equal molar 
basis to a final concentration of 0.002 M (on a per compound) basis: benzamide (121.14), nicotinamide (122.13), 

40 pyrazinamide (123,12), 3-amino-4-pyrazolecarboxylic acid (127.10), 2-thiophenecarboxamide (127.17), 4-aminoben- 
zamide (135.15), tolumide (135.17), 6-me|thylnicotinamide (136.15), 3-aminonicotinamide (137.14), nicotinamide N- 
oxide (138.12), 3-hydropicolinamide (138.13), 4-fluorobenzamide (139.13), cinnamamide (147.18), 4-methoxybenza- 
mide (151.17), 2,6-difluorbenzamide (157.12). 4-amino-5-imidazole-carboxyamide (162.58), 3,4-pyridine-dicarboxya- 
mide (165.16). 4-ethoxybenzamide (165.19). 2.3-pyrazinedicarboxamide (166.14), 2-nitrobenzamide (166.14), 

45 3-fluoro-4Hnethoxybenzoic acid (170.4), jndole-3-acetamide (174.2), 6-acetylsalicylamide (179.18). 3,5-dimethoxy- 
benzamide (181 .19), 1-naphthaleneacetamide (185.23), 8-chloro-3,5-diamino-2-pyrazinecartDoxyamide (187.59), 4-tri- 
fluoromethyl-benzamide (189.00), 5-amino-5-phenyl-4-pyrazole-carboxamide (202,22), 1-methyl-2-benzylmalona- 
mate (207.33), 4-amino-2,3.5,6-tetrafluorobenzamide (208.11). 2,3-napthlenedicart>oxylic acid (212.22). The com- 
pounds are placed in DMSO at the concentration described above. One ^1 of the material is then mixed with alpha- 

50 cyano-4-hydroxy cinnamic acid matrix (after a 1:10.000 dilution) and deposited on to a solid stainless steel support. 
[0509] The material is then desorbed by a laser using the Protein TOF Mass Spectrometer (Bruker, Manning Parte, 
MA) and the resulting ions are measured in both the linear and reflectron modes of operation. The following m/z values 
are observed (Figure 11 ): 

55 121.1— > benzamide (121,14) 

122.1 — > nicotinamide (122.13) 

123.1 — > pyrazinamide (123.12) 
124.1 
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2,3-napthlenedicarboxylic acid (212.22). 



[0510] The data indicate that 22 of 31 compounds appeared in the spectrum with the anticipated mass, 9 of 31 
compounds appeared in the spectrum with a n + H mass (1 atomic mass unit, amu) over the anticipated mass. The 
latter phenomenon is probably due to the protonation of an amine within the compounds. Therefore 31 of 31 compounds 
are detected by MALDI Mass Spectroscopy. More importantly, the example demonstrates that multiple tags can be 
detected simultaneously by a spectroscopic method. 

[0511] The alpha-cyano matrix alone (Figure 11) gave peaks at 146.2. 164.1, 172.1. 173.1, 189.1, 190.1, 191,1, 
192.1 . 212.1 . 224.1 , 228.0, 234.3. Other identified masses in the spectrum are due to contaminants in the purchased 
compounds as no effort was made to further purify the compounds. 

EXAMPLE 14 

MICROSATELLITE MARKERS: PCR AMPLIFICATIONS. 

[051 2] The microsatellite markers are amplified utilizing the following standard PCR conditions. Briefly, PCR reactions 
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are performed in a total volume of 50 jil, containing 40 ng of genomic DNA, 50 pmol of each primer, 0.125 mM dNTPs 
and I unit of Taq polymerase. IX amplification buffer contains 1 0 mM Tris base. pH 9, 50 mM KCI, 1 .5 mM MgCl2, 0. 1 % 
Triton X-100 and 0.01% gelatin. The reactions are performed using a "hot-start" procedure: Taq polymerase is added 
only after a first denaturation step of 5 minutes at OO^C. Amplification is carried out for 35 cycles: denaturation {94*'C 
5 for 40 sec) and annealing (55''C for 30 sec). An elongation step (72''C for 2 minutes) ends the process after the last 
annealing. Since the amplification products to be obtained are short (90 to 350 base pairs long) and the time interval 
to raise" the temperature from 55°C to 94°C"(obtained with a rarnpihg~rale of 1 *C/sec6n(i) is long enough, completion 
of DNA elongation can be achieved without a step at 72^C. 

10 EXAMPLE 15 

SEPARATION OF DNA FRAGMENTS 

Instrumentation 

15 

[051 3] The separation of DNA fragments can be performed using an HPLC system assembled from several standard 
components. These components include a minimum of two high pressure pumps which pump solvent through a high 
pressure gradient mixer, an injector, HPLC column, and a detector. The injector is an automated, programmable au- 
tosampler capable of storing typically between eighty and one hundred samples at or below ambient temperatures to 

20 maintain the stability of the sample components. The autolnjector also Is capable of making uL size injections in a 
reproducible manner completely unattended. The HPLC column Is contained in a heated column compartment capable 
of holding a defined temperature to within 0.1 °C. The column used In the examples below was purchased from SeraSep 
(San Jose, CA) under the name DNASep. This column is a 55x4.6 mm column with a 2.2um non-porous polystyren- 
edivinylbenzene copolymer particle alkylated with C18. The packing material is stable within a pH range of 2-12 and 

25 tolerates temperatures as high as lO^'C. Detection of analyte was perfomied using a single or multiple wavelength UV 
detector or diode array detector. 

Methods 

30 [0514] The methods applied in this example for separation of DNA fragments use ion-pair chromatography, a form 
of chromatography in which ions in solution can be paired or neutralized and separated as an ion pair on a reversed 
phase column. The lipophilic character and the concentration of the counterion determine the degree of retention of 
the analyte. In the case of a DNA molecule the lipophilic, cationic buffer component pairs with anionic phosphate groups 
of the DNA backbone. The buffer components also interact with the alkyi groups of the stationary phase. The paired 

35 DNA then elutes according to size as the mobile phase is made progressively more organic with increasing concen- 
tration of acetonitnle. Evaluation of the suitability of various amine salts was evaluated using enzymatic digests of 
plasmids or commercially available DNA ladders. The range of acetonitrile required to elute the DNA as well as the 
temperature of the column compartment varied with each buffer evaluated. 

40 Buffers 

[051 5] The buffers evaluated for their ion-pairing capability were prepared from stock solutions. In order to keep the 
concentration of ion-pair reagent the same throughout the gradient, the ion-pair reagent was added to both the water 
and the acetonitrile mobile phases. The column was equilibrated with a new mobile phase for approximately 18 hours 
45 at a flow rate of 50ul/minute before attempting any separation. Once a mobile phase had been evaluated, it was re- 
moved and the column cleaned with a flush of 800 mL 0.1% formic acid in 50% acetonitrile. followed by a flush with 
800 mL 0.1% acetic acid in 50% acetonitrile before equilibration with a new mobile phase. 

A. nn-Dimethyloctylammonium trifluoroacetate 

50 

[0516] A stock solution of 1 molar dimethyloctylammonium trifluoroacetate was prepared by mixing one half of an 
equivalent of trifluoroacetic acid in an appropriate volume of water and slowly adding one equivalent of nn-Dimethyl- 
octylamine. The pH of this stock solution Is 7. The stock solution was diluted with an appropriate volume of water or 
acetonitrile to woricing concentration. 

55 

B. nn-Dimethylheptylammonium acetate 

[051 7] A stock solution of 1 molar dimethylheptylammonium acetate was prepared by mixing one equivalent of glacial 
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acetic acid in an appropriate volume of water and slowly adding one equivalent of nn-Dimethylheptylamine. The pH of 
this stock solution is 6.6. The stock solution was diluted with an appropriate amount of water or acetonitrile to worthing 

concentration. 

5 C. nn-Dimethylhexylammonium acetate 

[0518] A stock solution of 1 molar dimethylhexylammoriium acetate was prepared by mixing one equivalent of glacial ~ " 
acetic acid in an appropriate volume of water and slowly adding one equivalent of nn-Dimethylhexylamine, The pH of 
this stock solution Is 6.5. The stock solution was diluted with an appropriate volume of water or acetonitrile to woricing 
10 concentration. 

D. nn-Dimethylbutylammonium acetate 

[0519] A stock solution of 1 molar dimethylbutylammonium acetate was prepared by mixing one equivalent of glacial 
15 acetic acid in an appropriate volume of water and slowly adding one equivalent of nn-Dimethylbutylamine. The pH of 
the stock solution is 6.9. The stock solution was diluted with an appropriate volume of water or acetonitrile to woricing 
concentration. 

E. nn-Dimethylisopropylammonium acetate 

20 

[0520] A stock solution of 1 molar dim ethyl isopropylammonium acetate was prepared by mixing one equivalent of 
glacial acetic acid in an appropriate volume of water and slowly adding one equivalent of nn-Dimethylisopropylamine. 
The pH of the stock solution is 6.9. The stock solution was diluted with an appropriate volume of water or acetonitrile 
to working concentration. 

25 

F. nn-Dimethylcyclohexylammonium acetate 

[0521] A stock solution of 1 molar dimethylcydohexylammonium acetate was prepared by mixing one equivalent of 
glacial acetic acid in appropriate volume of water and slowly adding one equivalent of nn-Dimethylcyclohexylamine, 
30 The pH of the stock solution is 6.5. The stock solution was diluted with an appropriate volume of water or acetonitrile 
to working concentration. 

G. Methylpiperidine acetate 

35 [0522] A stock solution of 1 molar methylpiperidine acetate was prepared by mixing one equivalent of glacial acetic 
acid in an appropriate volume of water and slowly adding one equivalent of 1 -methylpiperidine. The pH of the solution 
is 7. The stock solution was diluted with an appropriate volume of water or acetonitrile to working concentration. 

H. Methylpyrrolidine acetate 

40 

[0523] A stock solution of 1 molar piperidine acetate was prepared by mixing one equivalent of glacial acetic acid in 
an appropriate volume of water and slowly adding one equivalent 1 -methylpyrrolidine. The pH of the stock solution is 
7. The stock solution was diluted in an appropriate volume of water or acetonitrile to working concentration. 

45 I. Triethylammonium acetate 

[0524] A stock solution of 2 molar triethylammonium acetate pH 7.0 was purchased from Glenn Research Steriing, 
Virginia. The stock solution was diluted in an appropriate volume of water or acetontrile to working concentration. 

50 EXAMPLE 16 

DNA FINGERPRINT 

[0525] DNA fingerprinting adaptors are prepared comprising the following: a core sequence and an enzyme specific 

55 sequence. The structure of the EcoR1-adapter is 5 -CTCGTAGACTGCGTACC (SEQ II) No. ). the stnjcture of 

the Mse1 -adapter is: 5"-GACGATGAGTCCTGAG 

[0526] Adapters for the rare cutter enzymes were identical to the EcoR1 with the exception that cohesive ends were 
used. ALPH primers consists of three parts: a core sequence, an enzyme specific sequence and a selective extension 
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sequence. The EcoR^ and Mse^ primers are described as follows: EcoRI: 5'-gactgcgtaaa-aattc-NNN (SEQ ID No. J; 
Mse1: 5'-gatgagtcclgag-taa-NNN (SEQ ID No. ). 

[0527] Genomic DNA was incubated for 1 hour at ZJ^'C with 5 units EcoRI and 5 units of Msel in 40 p.1 volumes with 
10 mM Tris-acetate pH 7.5, 10 mM MgAce, 50 mM KAcetate, 5 mM DTT. 50 ng/microHter BSA, 5 mM DTT. Next. 10 
5 >il of a solution containing 5 pMol EcoRI adapters, 50 pMol A^sel adapters,, 1 unit of T4 tigase, 1 mM AIR in 10 mlVt 
Tris-acetate pH 7.5, 10 mM MgAce. 50 mM KAcetate, 5 mM DTT, 50 ng/microliter BSA was added and the incubation 
was continued fofS hours at'37**C' Adapters were prepared by adding eqiiimolar amounts of both strands: adapters 
were not phosphorylated. After ligation, the reaction mixture was diluted to 500 ^il with 10 mM Tris HCI, 0.1 mM EDTA 
pH8.0 and stored at -20°C. 

10 [0528] Genetic fingerprinting reactions: Amplification reactions are described using DNA templates for the enzyme 
combination EcoRyMsel Genomic fingerprints with other enzyme combinations were performed with appropriate prim- 
ers. The amplification reactions generally employed two oligonucleotides, one corresponding to the EcoRI pends and 
one corresponding to the Afsel -ends. One of the two primers was labelled with the CMST tag, preferably the ECORI 
primer The PGR s were performed using 5 ng labeled EcoRI primer, 30 ng Msel primer, 5 microliters of template DNA. 

15 OA units Taq polymerase, 1 0 mM Tris-HCI, pH 8.3. 1 .5 mM MgCL2.50 mM KCI, 0.2 mM of dATP. dGTP. dCTP, dTTR 
The PGR reactions differed depending on the nature of the selective amplification extensions of the DNA fingerprinting 
primers used for amplification. DNA fingerprinting reactions with primers having two two or three selective nucleotides 
were performed for 36 cycles with the following cyde profile: a 30 second DNA denaturation step, at 94**G. a 30 second 
annealing step at 55**C, and then a 1 minute extension step at 72°G for 1 minute. The annealing temperature in the 

20 first step was 65°G and was subsequently reduce for each cycle step by 0.7*^C in the next 1 2 cycles and was continued 
at 56° G for the remaining 23 cycles. All amplifications were performed in a an MJ thermocyder (Watertown MA). 
[0529] DNA fingerprinting of the complex genomes (such as humans) involve two amplification steps. The pream- 
plification was performed with two DNA fingerprinting having a single selective nudeotide as described at)ove with the 
exception that 30 ng of t>oth DNA fingerprinting primers was used and that these primers were not labelled with GMST, 

25 after the preamplification step, the reaction mixtures were diluted 10-fold with with 10 mM Tris-HGI, 0.1 mM EDTA. 
pH 8.0, and used as templates for the second amplification reaction. The second amplification reaction was performed 
as described above for DNA fingerprinting reactions with primers having the longer selective extensions. 
[0530] The products from the amplification reactions were analyzed by HPLG. HPLC was carried out using automated 
HPLC instrumentation (Rainin. Emeryville, CA., or, Hewlett Packard, Palo Alto, CA+). Unpurified DNA fingerprinting 

30 products which had been denatured for 3 minutes at 95 prior into injection into an HPLC were eluted with linear ace- 
tonitrile (AGN, J.T Baker, NJ) gradient of 1.8%/minute at a flow rate of 0.9 ml/minute, the start and end points were 
adjusted according to the size of the amplified products. The temperature required for the successful resolution of the 
molecules generated during the DNA fingerprinting technique was 50°G. The effluent from the HPLC was then directed 
into a mass spectrometer (Hewlett Packard, Palo Alto, GA) for the detection of tags. 

35 [0531] The following fragments eluted in the order presented (The number sited are the positions within the lambda 
genome at which a deavage site occun-ed): 47, 78. 91, 733. 1456. 2176, 3275, 3419, 4349, 444, 5268. 5709, 6076, 
6184. 6551. 7024. 7949, 8062. 8200. 8461. 9079. 9253, 9692. 9952. 11083. 11116. 11518. 11584. 12619. 12967, 
14108. 14892, 15628, 15968. 16034, 16295, 16859. 18869. 19137, 19482, 20800, 21226, 21441 . 2635, 21702, 21903. 
21948. 22724, 23048, 23084. 23111 , 23206. 23279, 23285, 23479, 23498, 23555, 23693, 23887, 23979, 23987, 24073, 

40 24102, 24751 , 24987, 25170, 25255. 25353. 25437, 26104, 25578, 25746, 25968. 26133. 26426. 26451 . 26483, 26523. 
26585. 26651 . 26666. 26679. 26693, 26763, 2681 0. 26984. 26993, 27038. 27092. 27203. 2731 7. 27683, 28456. 28569. 
28922, 28972. 29374, 29981 . 30822. 30620. 30639, 30722, 30735, 30756. 311 69, 31 747. 31 808. 321 94. 3221 8. 32641 . 
32704. 33222, 33351 , 33688. 33736, 33748. 33801 . 34202, 34366. 34406. 34590, 3461 8. 34684. 34735. 34753. 34831 , 
35062. 35269. 35534, 35541 , 36275, 36282, 36303. 36430. 36492, 36531 , 36543, 36604, 36736, 36757. 36879. 37032. 

45 37442. 37766. 37783. 37882, 3791 6, 37994, 361 64. 38287. 3841 2, 38834. 39 1 68. 44972, 39607, 39835. 401 27. 40506, 
40560. 40881 , 41 01 7, 41 423, 41 652, 41 71 5, 4231 7. 42631 . 42651 . 42673, 4281 4, 4341 0, 43492. 43507, 43528. 43593, 
44424. 44538. 44596. 44868. 451 51 .45788, 46033. 46408. 46556, 46804, 46843, 46853, 46896, 46952, 47256. 47274. 
47287. 47430. 47576. 47699. 47799, 48059, 48125, 48227. 48359. 48378. The average fragment length was about 
160 nt. The observed sites of deavage were largely (>95%) compatible with that predicted from an MSEI/RcoRI digest 

50 map. 

EXAMPLE 17 

SINGLE NUCLEOTIDE EXTENSION ASSAYS 

55 

[0532] RNA preparation: Total RNA was isolated was prepared from Jurt^at cells using (starting with 1x10^ cells in 
exponential growth) using an RNA isolation kit from Promega (Wl). RNA was stored in two aliquots: 1) stock aliquote 
In diethyl pyrocarbonate-treated ddH20 was stored at -20°C. and 2) long temri storage as a suspension in 100% H20. 
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[0533] Reverse Transcription: Poly(dT) primed reverse transcription of total RNA was performed as described as 
described in Ausubel et al. (Ausubel et al., in Cuaent Protocols in Molecular Biology, 1991, Greene Publishing Asso- 
ciatesA/Viley-lnterscience. NY, NY.) except that the reaction(s) were scaled to using 1 ^g of input total RNA. 20-50 units 
of reverse transcriptase (Promega) was diluted 10-fold in 10% glycerol, 10 mM KP04 pH 7,4. 0.2% Triton X-100. and 

5 2 mM DTT and placed on ice for 30 minutes prior to addition to the reactions. Gene-specific reverse transcription for 
G ADPH and other control genes as described below were performed using 1 of total Jurkat RNA reversed transcribed 

in 10 mM'Tris-HCI pH*8.3, 50 mM KCI. M MgCL2.^1 mMUNTPs. 2 U/fil RNAsih (Gibco-BRL)r0:1 ^M oligomer and 

0.1 25 U/^l of M-MLV reverse transcriptase (Gibco-BRL) in 20^1 reactions. Reactions were incubated in at 42°C for 1 5 
minutes, heat inactivated at 95°C for 5 minutes , and diluted to 100 \i\ with a master mix of (10 mM Tris HCI pH 8.3, 1 

10 mM NH4CI. 1 .5 M MgC1 2, 1 00 mM KCI). 0. 1 25 mM NTPs. 1 0 ng/ml of the respective oligonucleotide primers and 0.75 
units of TAQ polymerase (Gibco-BRL) in preparation for PGR amplification. 

[0534] PGR: PGR for each gene was performed with gene specific primers spanning a known intron/exon boundry 
(see below). All PCRs were done in 20|xl volumes containing 10 mM Tris HCI pH 8.3. 1 mM NH4G1, 1.5 M MgGI2, 100 
mM KCI), 0.125 mM NTPs, 10 ng/ml of the respective oligonucleotide primers and 0.75 units of TAQ polymerase 
15 (Gibco-BRL). Cycling parameters were 94''C preheating step for 5 minutes followed by 94''C denaturing step for 1 
minute. 55^G annealing step for 2 minutes, and a 72''G extension step for 30 seconds to 1 minute and a final extension 
at 72''C for 10 minutes. Amplifications were generally 30-45 in number. 

[0535] Purification of templates: PGR products were gel purified as described by Zhen and Swank (Zhen and Swank. 

BioTechniques, t4(6):894-898. 1993). PGR products were resolved on 1% agarose gels run in 0.04 M Tris-acetate, 
20 0.001 M EDTA (1x TEA) buffer and stained with ethidium bromide while visualizing with a UV light source. A trough 

was cut just in front of the band of interest and filled with 50-200 ^1 of 1 0% PEG in 1 x TAE buffer. Electrophoresis was 

continued until the band had completely entered the trough. The contents was then removed and extracted with phenol, 

cholorfonn extracted, and then precipitated in 0.1 volume of 7.5 M ammonium acetate and 2.5 volumes of 1 00% EtOH. 

Samples were washed with 75% EtOH and briefly dried at ambient temperature. Quantitation of yield was done by 
25 electrophoresis of a small aliquot on 1% agarose gel in 1x TBE buffer with ethidium bromide staining and comparison 

to a known standard. 

[0536] Each SNuPE reaction was canned out in a 50 ^1 volume containing about 1 00 ng of the amplified DN A fragment. 
1 |iM of the SNuPE primer, 2 units of Tag polymerase, and 1 ul of the appropriate dNTP. All dNTPs are unlabelled in 
this type of assay. The buffer used was 10 mM Tris-HCI (pH 8.3), with 50 mM KCI, 5mM MgCI2 and 0,001% (wt/vol) 
30 gelatin. The samples were subjected to one cycle consisting, of a 2-minute denaturation period at 95°C, a 2 minute 
annealing period at eO^'C and a 2-minute primer extension period at 72'*G. The sequence of the SNUPE primer for 
each family is described below. 

[0537] Primer extensions: Single nucleotide primer extensions were performed as described in Singer-Sam et al.. 
(Singer-Sam et al., PCR Methods and Applications t:160-163, 1992) except that 1 mM Mg++, 0.1 ^M primer, and 0.05 
35 ^M of each dNTP type was used in each reaction type. After each primer extension described above, one-fifth volume 
of a loading dye (80% fonnamide. 0.1 % bromophenol blue, 0.1 % xylene cyanol, 2 mM EDTA) was added, and the 
entire sample electrophoresed in 15% denaturing polyacrylamide gel. Gels were fixed in 10% glycerol, 10% methanol. 
10% glacial acetic acid with constant shaking followed by washing steps with 10% glycerol. The gels were then dried 
at 55°C for 3-5 hours. 

40 [0538] The primers described in this experiment are described by Rychlik (Rychlik, BioTechniques 1 8:84-90. 1 995) 
Primers may be synthesized or obtained as gel-flirtation grade primers from Midland Certified Reagent Company (Mid- 
land Texas). The amplifications are either TAQ DNA polymerase-based (10 mM Tris-HCI pH 8.3. 1.5 mM MgGI2, 50 
mM KGI) or Pfu DNA polymerase-based based (20 mM Tris-HCI pH 8.3, 2.0 mM MgGI2. 10 mM KCI. 10 mM (NH4) 
2S04, 0.1% Triton X-100, 0.1 mg/ml bovine senjm albumin). The total nucleoside triphosphate (NTPs) concentration 

<5 in the reactions is 0.8 mM, the primer concentration is 200 nM (unless othenwise stated) and the template amount is 
0.25 ng of bacteriophage lambda DNA per 20 ^1 reaction. Cycling parameters were 94°C preheating step for 5 minutes 
followed by 94**C denaturing step for 1 minute. 55°C annealing step for 2 minutes, and a 72**C extension step for 30 
seconds to 1 minute and a final extension at 72^C for 10 minutes. Amplifications were generally 30-45 in number 
[0539] Two regions in the bacteriophage lambda genome (GenBank Accession #J02459) were chosen as the priming 

50 sites for amplification. The 5'-primer has a stable GC-rich 3'-end: the 3' primer is chosen so that a 381 bp product will 

result. The 5' forward primer is H17: 5'-GAACGAAAACCCCCCGC (SEQ ID No. ). The 3'-reverse primer is RP17: 

5'-GATCGCCCCCAAAACACATA (SEQ ID No. ). 

[0540] The amplified product was then tested for the presence of a polymorphism at position 31 245. The following 

primer was used in four single nucleotide extension assays; SNE1 7: 5'-GAACGAAAACCCCCCGC (SEQ ID No. ). 

55 The four single nucleotide extension assays were then carried as described above. All the reactions are then pooled 
and 5 ^1 of the pooled material was injected onto the HPLC column (SeraSep, San Jose, CA) without further purification. 
[0541] HPLC was carried out using automated HPLC instrumentation (Rainin, Emeryville. CA.. or. Hewlett Packard, 
Palo Alto, CA). Unpurified SNEA products which had been denatured for 3 minutes at 95 prior into injection into an 
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HPLC were eluted with linear acetonitrile (ACN, J.T. Baker. N J) gradient of 1 .8%/minute at a flow rate of 0.9 ml/minute, 
The start and end points were adjusted according to the size of the SNEA product. The temperature required for the 
successful resolution of the SNEA molecules was 50°C. The effluent from the HPLC was then directed into a mass 
spectrometer (Hewlett Packard, Palo Alto. CA) for the detection of tags, with the results set forth in TABLE 2, 



TABLE 2 



Tagged Primer 


ddNTP type 


retention time 


extended? 


SNE17^87 


ddATP 


2.5 minutes 


no 


SNE1 7-496 


ddGTP 


2.5 minutes 


no 


SNE1 7-503 


ddCTP 


4.6 minutes 


yes 


SNE1 7-555 


ddTTP 


2.5 minutes 


no 



[0542] The results therefore indicate that the mass spectrometer tag (CMST) tag was detected at a retention time 
of 4.6 minutes indicating that the SNE17 primer was extended by one base (ddCTP) and therefore the polymorphism 
was position 31245 was in this case a "G". The SNE1 7-487, SNE1 7-496, and SNE1 7-555 tagged primers were not 
extended and their retention times on the HPLC was 2.5 minutes respectively, 

EXAMPLE 18 

[0543] In this Example (18), all reactions were conducted in foil-covered flasks. The sequence of reactions A^F 
described in this Example is illustrated in Figures 19Aand 19B. Compound numbers as set forth in this Example refer 
to the compounds of the same number in Figures 19A and 198. 

A. To a solution of ANP linker (compound 1, 11.2 mmol) and dlisopropylethylamine (22.4 mmol) in CHCI3 (60 ml) 
was added allyl bromide (22.4 mmol). The reaction mixture was refluxed for 3 hours, stirred at room temperature 
for 18 hours, diluted with CHCI3 (200 ml), and washed with 1.0 M HCI (2 x 150 ml) and H2O (2 x 150 ml). The 
organic extracts were dried (MgS04) and the solvent evaporated to give compound 2 as a yellow solid. 

To a mixture of compound 2 in CH2CI2 (70 ml), tris (2-aminoethyl) amine (50 ml) was added and the reaction 
rriixture stin-ed at room temperature for 18 hours. The reaction was diluted with CH2CI2 (150 ml) and washed with 
pH 6.0 phosphate buffer (2 x 150 ml). The organic extracts were dried (MgS04) and the solvent evaporated. The 
residue was subjected to column chromatography (hexane/EtOAc) to give 1 .63 g (58%) of compound 3: ^H NMR 
(DMSO-dfi): 5 7.85 (dd, 2H), 7.70 (t, 1H), 7,43 (t. 1H). 5,85 (m, 1H). 5,20 (q, 2H). 4.58 (q. 1H). 4.50 (d, 2H). 2.70 
(m, 2H). 2.20 (br s, 2H). 

B. To a solution of Boc-5-aminopentanoic acid (1 .09 mmol) and NMM (3.27 mmol) in dry DMF (6 ml), was added 
HATU (1.14 mmol) and the reaction mixture stirred at room temperature for 0.5 hours. A solution of compound 3 
(1.20 mmol) in dry DMF (1 ml) was added and the reaction mixture stirred at room temperature for 18 hours. The 
reaction was diluted with EtOAc (50 ml) and washed with 1,0 M HCI (2 x 50 ml) and D.I. H2O (2 x 50 ml). The 
organic extracts were dried (MgS04) and evaporated to dryness. The residue was subjected to column chroma- 
tography to give 420 mg (91%) of compound 4: ^H NMR (DMSO-dg): 6 8.65 (d, 1H), 7.88 (d. 1H). 7.65 (m, 2H), 
7,48 (t. 1H). 6.73 (br s. 1H). 5.85 (m. 1H). 5.55 (m. 1H). 5.23 (q. 2H), 4.55 (d. 2H). 2.80 (m, 2H), 2.05 (t. 2H). 1.33 
(s. 9H). 1.20-1 .30 (m. 4H), 

C. A solution of compound 4 (0.9 mmol) in HCI'1 ,4-dioxane (20 mmol) was stirred at room temperature for 2 hours. 
The reaction mixture was concentrated, dissolved in MeOH and toluene, and concentrated again (5 x 5ml) to give 
398 mg (quantitative) of the compound 5: ^H NMR (DMSO-dg): 6 8.75 (d. 1H). 7.88 (d. 1H). 7.65 (m. 2H), 7.51 (t. 
1H). 7.22 (m. 2H),5.85 (m, 1H), 5,57 (m. 1H). 5.23 (q. 2H). 4.55 (d. 2H). 2.80 (m. 2H). 2.71 (m. 2H). 2.07 (s, 2H), 
1,40-1.48 (br s. 4 H). 

D. To a solution of compound 21 (0.48 mmol, prepared according to Example 20) and NMM (1.44 mmol) in dry 
DMF (3 ml), was added HATU (0.50 mmol) and the reaction mixture stirred at room temperature for 0.5 hours. A 
solution of compound 5 (0.51 mmol) in dry DMF (3 ml) was added and the reaction stinred at room temperature 
for 18 hours. The reaction mixture was diluted with EtOAc (75 ml) and washed with 5% Na2C03 (3 x 50 ml). The 
organic extracts were dried (MgS04) and the solvent evaporated to give 281 mg (78 %) of compound 6: ^H NMR 
(DMSO-dg): 68.65 (d. 1H), 8.17 (d. 1H), 7.82-7.95 (m, 4H), 7.68 (m, 3H). 7.50 (t, 1H). 6.92 (d. 1H), 5.85 (m. 1H). 
5.57 (m, 1H). 5.20 (q. 2H). 4.55 (d. 2H). 4.30 (q. 1H), 4.05 (q. 2H). 2.95 (m. 4H). 2.80 (m, 2H), 2.72 (m. 2H). 2.05 
(s, 3H). 2.01 (t. 2H). 1.58-1.77 (m, 3H), 1,50 (m, 4H), 1.30 (q, 3H). 1.17-1.40 (m, 9H). 

E. To a mixture of compound 6 (0.36 mmol) in THF (4 ml), was added 1 M NaOH (1 mmol) and the reaction stirred 
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at room temperature for 2 hours. Ttie reaction mixture was acidified to pH 7.0 with 1 .0 M HCI (1 mi) and the solvent 
evaporated to give compound 7 (quantitative): NMR (DMSO-de): 5 8.65 (d, 1H). 8.17 (d. 1H), 7.82-7.95 (m. 
4H). 7.68 (m, 3H). 7.50 (t. 1H), 6.92 (d, 1H), 5.52 (m, 1H), 4.30 (q. 1H). 4.05 (q, 2H), 2.95 (m. 4H), 2.80 (m. 2H). 
2.72 (m. 2H). 2.05 (s. 3H). 2.01 (t. 2H). 1.58-1.77 (m. 3H), 1.50 (m. 4H). 1.30 (q. 3H). 1.17-1.40 (m, 9H). 
5 F. To a solution of compound 7 {0.04 mmol) and NMM (0.12 mmol) in dry DMF (0,4 ml), was added HATU (0.044 

, mmol) and the_reaction sjirred at^room temperature for 0,5 hours. Allylamine (0.12 mmol) was added and the 

reaction mixture stirred at room temperature for 5 hours. The reaction mixture~was diluted with EtOAc (15 ml) and 
washed with 5% NajCOg (3x10 ml). The organic extracts were dried (MgS04) and the solvent evaporated to yield 
15 mg (49%) of compound 8: NMR (DMSO-dg) 6 8.49 (d, 1H). 8.17 (d. 1H), 7.82-7.95 (m, 4H). 7.68 (m. 3H). 
10 7.50 (t, 1H). 6.92 (d. 1H). 5.72 (m, 1H). 5.50 (m, 1H). 5.03 (q, 2H). 4.37 (d. 2H), 4.30 (q, 1H). 4,05 (q. 2H), 2.95 

(m. 4H), 2.80 (m. 2H). 2.72 (m, 2H), 2.05 (s, 3H), 2.01 (t. 2H), 1,58-1.77 (m. 3H). 1.50 (m. 4H), 1.30 (q. 3H), 
1.17-1.40 (m. 9H). 

EXAMPLE 19 

15 

[0544] The sequence of reactions A->G as described in this Example 1 9 Is illustrated in Figures 20A and 20B. Com- 
pound numbers as set forth in this Example refer to the compounds of the same number in Figures 20A and 20B. 

A. To a solution of Fmoc-Lys(Boc)-OH (compound 9, 33.8 mmol) in CHCI3 (200 ml), was added diisopropylethyl- 
20 amine (67.5 mmol) and allyl bromide (67.5 mmol). The reaction mixture was reftuxed for 6 hours, stirred at room 

temperature for 16 hours, diluted with CHCts, washed with 1.0 M HCt (2 x 150 ml), saturated NaHC03 (1 x 150 
ml) and D.I. H2O (2 x 150 ml). The organic extracts were dried (MgS04) and the solvent evaporated to yield 
compound 10, 

To a solution of compound 10 in CHCI3 (90 ml), was added pyrrolidine (10 eq.) and the reaction was stirred 
25 at room temperature for 2.5 hours. The reaction mixture was diluted with CHCI3 (1 50 ml) and washed with saturated 

NaHC03 (3 X 250 ml). The organic extracts were dried (MgS04) and the solvent evaporated. The residue was 
subjected to column chromatography ( EtOAc / MeOH) to give 6.52 g (67%) of compound 11: NMR (CDCI3): 
5 5.90 (m. 1H). 5.27 (m, 2H), 4.60 (d. 2H), 3.48 (t, 1H), 3.10 (d, 2H). 1.40-1.78 (m. 9H).1.40 (s. 9H), 

B. To a solution of N-methylisonipecotic acid (1.60 mmol) and N-methyl morpholine (4.80 mmol) in dry DMF (5 
30 ml), was added HATU (1 .67 mmol). After 0.5 hours, a solution of compound 11 (1 .75 mmol) in dry DMF (2 ml) was 

added and the reaction mixture stirred at room temperature for 18 hours. The reaction mixture was diluted with 
CH2CL2 (60 ml) and washed with saturated Na2C03 (3 x40 ml). The organic extracts were dried (MgS04) and the 
solvent evaporated. The residue was subjected to column chromatography (CH2CI2 / MeOH / triethylamine) to 
give 580 mg (88%) of compound 12: NMR (DMSO): 5 8.12 (d, 1H). 6.77 (t, 1H). 5,90 (m, 1H), 5.27 (m, 2H), 
35 4.53 (d, 2H), 4.18 (m, 1H). 2,62-2.90 (m. 5H), 2.13 (s. 3H),1.85 (m. 2H).1.57 (m. 5H),1.35 (s, 9H), 1.00 (t, 2H). 

C. A mixture of compound 12 (1.39 mmol) in HChl. 4-dioxane (20 mmol) was stirred at room temperature for 4 
hours. The reaction mixture was concentrated, dissolved in MeOH, coevaporated with toluene (5 x 5ml) to give 
527 mg (quantitative) of compound 13: ^H NMR (DMSO-dg): 5 8.12 (d. 1H). 6.77 (t. IN). 5.90 (m, 1H), 5.27 (m. 
2H). 4.53 (d. 2H). 4.18 (m. 1H).2.65-3.00 (m. 8H), 2.23 (s, 3H).1.85 (m. 2H).1.57 (m. 5H), 1.00 (t, 2H). 

40 0. To a solution of 4-ethoxybenzoic acid (1 eq.) in dry DMF, is added NMM (3 eq.) and HATU (1 .05 eq.). After 0.5 

hours, a solution of compound 13 in dry DMF is added. After the completion of the reaction and basic workup, the 
compound 14 is isolated and purified. 

E. To a solution of compound 14 in THF, is added IN NaOH and the reaction mixture stirred at room temperature. 
After the completion of the reaction and acidification, the compound 15 is isolated. 
45 R To a solution of compound 15 (1 eq.) in dry DMF, is added NMM (3 eq.) and HATU (1.05 eq.). After 0.5 hours. 

a solution of compound 21 (ANP - allyl ester, prepared according to Example 20) in dry DMF is added. After the 
completion of the reaction and basic workup, the title compound 16 is isolated and purified. 
G. To a solution of compound 16 in THF, is added IN NaOH and the reaction mixture stirred at room temperature. 
After the completion of the reaction and acidification, the compound 17 is isolated. 

50 

EXAMPLE 20 

[0545] The sequence of reaction A through D as described in this Example 20 is illustrated in Figure 21 . Compound 
numbers as set forth in this Example, as well as Examples 18 and 19, refer to the compounds of the same number in 
55 Figure 21. 

A. To a solution of 4-ethoxybenzoic acid (7.82 mmol) and N-methyl morpholine (20.4 mmol) in CHjClj (10 ml), 
was added HATU (7.14 mmol). After 0.25 hours, a solution of compound 1 1 (6.8 mmol) in CH2CI2 (6 ml) was added 
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and the reaction mixture stirred at room temperature for 18 hours. The reaction was diluted with CH2CI2 (150 ml) 

and washed with 1 .0 M HCI (3 x 50 ml) and saturated NaHCOs (3 x 50 ml). The organic extracts were dried (MgS04) 
and the solvent evaporated. The residue was subjected to column chromatography (CH2Cl2/MeOH) to give 2.42 
g (82%) of compound 18: NMR (CDCI3): 5 7.78 (d, 2H), 6.91 (d. 2H). 6.88 (d. 1H), 5.83-5.98 (m, IH). 5.21-5.38 
5 (m. 2H), 4.80 (q. 1H), 4.66 (d, 2H). 4.06 (q. 2H), 3.11 (q, 2H). 1.90-2.04 (m. 1H). 1,68-1.87 (m. IH), 1.39 (I. 3H). 

1.34 (s,9H). 1.32-1.58 {m^4H). 

B. A mixture of compound 18 (5.5~^mmol) Tn HCI ,~4-dioxane"(14.3 mmol) was stirred at room temperature for 1 - 
hour. The reaction mixture was concentrated, dissolved in MeOH, azeotroped with toluene, and concentrated again 
(5x5 ml) to give a quantitative yield of compound 19. 

10 C. To a solution of N-methylisonipecotic acid (6.21 mmol) in dry DMF (15 mL), was added NMM (21 .6 mmol) and 

HATU (5.67 mmol). After 0.5 hours, a solution of compound 19 (5.4 mmol) in dry DMF (1 0 ml) was added and the 
reaction stirred at room temperature for 18 hours. The reaction mixture was brought to pH 12 with IN NaOH (20 
ml) and extracted with CHCI3 (2 x 200 ml). The organic extracts were dried (MgS04) and the solvent evaporated 
to give 2.2 g (89%) of compound 20: NMR (DMSO-dg): 6 8.52 (d, 1H), 7.84 (d. 2H), 7.72 (t. I H). 6.95 (d, 2H). 

15 5.80-5.95 (m. IH). 5.18-5.31 (dd. 2H). 4.58 (d, 2H). 4.37 (q, 1H). 4.08 (q, 2H). 3.01 (d. 2H), 2.08 (s. 3H). 1.95 (m. 

IH). 1.63-1.82 (m. 4H). 1.51 (m. 4H). 1.32 (t, 3H). 1.22-1.41 (m, 6H). 

D. To a solution of compound 20 (4.4 mmol) in THF ( 1 0 ml), is added 1 N NaOH (4.4 mmol) and the reaction mixture 
stirred at room temperature for 1 hour. The reaction was concentrated, dissolved in THF/toluene (2x5 ml), con- 
centrated, dissolved In CH2Cl2/toluene (1x5 ml) and concentrated again to give a quantitative yield of compound 
20 21: 1H NMR (DMSO-dg): 6 7.76 (d, 2H), 6,96 (d. 2H), 4.04 (q. 2H). 3.97 (d. IH), 2.97 (d, 2H). 2.64 (d. 2H), 2.08 

(s. 3H), 1,95 (m. 1H). 1.58-1.79 (m. 4H). 1.44 (m. 6H). 1.30 (t. 3H). 1.11-1.35 (m, 4H). 

EXAMPLE 21 

25 [0546] The synthesis of the CMSTs (Cleavable, Mass Spectrometry-detectable Tags) may be based on a combina- 
torial approach as described in Figure 22. The general approach is designed to be compatible with developments of 
the mass spectrometers and changes and improvements in ionization technologies. A central scaffold is first tested 
for compatibility with the type of ionization that is to be employed in the method. It is important that the scaffold not be 
succeptable to fragmentation, heat degradation, or dimerization to adduct formation. With the current APCI/quadrapole 

30 mass spectrometers, about 400 tags will fill the spectrum taking into account isotopic contamination which forces a 
minimal spacing of about 4 AMU. 

[0547] The CMSTs were designed in a modular fashion such that the tags can be built with a combinatorial chemistry 
approach. There are 5 "modules" (functionally separate atomic groups), in the complete tagged-oligonucleotide. The 
first module is the oligonucleotide (ODN). which can be of any length and sequence and which possesses a 5'-hexy- 

35 lamine for coupling to the linker, which is preferably a photodeavable linker. The ODN can serve either as a probe for 
hybridization or a primer in an extension, ligation, or er\zymatic-based assay. The 2nd module is the linker, preferably 
a photo-labile linker, that connects the tag to the oligonucleotide probe or primer. In the approach described in detail 
here, the photolabiie linker Is a o-nitrobenzyl amino add derivitive (see Lebi M. et al. "Dynamic database of references 
in molecular diversity" having internet address http://www.5z.com. for a compendium). The 3rd module Is the ionization 

40 enhancer. This module is the scaffold upon which the CMST is synthesized, and in general provides functionally that 
is specific for the type of ionization method used (i.e., APCI (positive or negative mode), electrospray, MALDI. etc). 
The 4th module is the gross mass adjuster upon which the mass can be altered in large increments of 200-500 amu. 
This module allows the re-use of the variable weight adjusters. The fifth module is the variable weight adjuster, also 
referred to variable mass units (VMU). Variable weight adjusters are chemical sidearms which are added to the tag 

45 scaffold. These variable weight adjusters fine-tune the weight of the CMSTs, The weight of the tags is spaced at least 
every 4 amu to avoid overiapping spectra due to isotopic contamination. The same VMU sidearms may be repeatedly 
used with the different types of ionization scaffolds. In summary, the ionization module and variable mass adjusters 
are designed to confer predictable behaviour in the MSD, the photodeavage of the photolabiie linker is fast (described 
below), the CMST/oligonudeotide conjugates are compatible with with PCR and HPLC and other manipulations found 

50 in assay fonnats. 

[0548] A detailed synthetic route of the tags is described below and in Figure 22. The synthetic route towards a 
CMST begins with the esterification of the photosensitive ANP linker (3-amino-3-(2-nitrophenyl) propionic acid, (1) to 
give the ethyl ester hydrochloride (2) in 84% yield. An important step in the process is the enzymatic transformation 
of (2) to provide the ethyl ester as a single isomer. After the ethyl ester hydrochloride has been basified to the free 
55 amine and concentrated, the oily residue was brought up in pH 7 phosphate buffer and adjusted to neutral pH with 2N 
HCI. The Amano PS esterase enzyme was added as a phosphate buffer slurry. After the completion of the reaction, a 
basic workup removed the hydrolyzed ANP byproduct (4), and the single-isomer ethyl ester (3, >99% e.e.) was recov- 
ered (92% of available material). 
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[0549] Coupling of (3) with a-BOC-e-alloc-lysine (5), using EDAC and HOBT, gave the protected ANP lysine (6) in 
91% yield. Removal of BOC with TFA provided the amino-e-alloc-lysine ANP ester (7) as a white solid. Methyl isonipe- 
cotic acid hydrochloride was coupled to 7, using EDAC and triethylamine, to give crude allocprotected core structure 
(8), which was deprotected with diethylamine, triphenylphosphine, and palladium acetate, at 50 degrees C. The re- 

5 suiting core structure (9) was crystallized from the reaction mixture and recovered by filtration in 95% yield as a yellow 

- solid.. _ _ ^ 

[0550] A variety of carboxylic acids, designated variable mass units (VMU), were coupled to (9), using HATU and"N~ 
methyl morpholine, A set of VMU's were designed to provide even-mass tags and 4 a.m.u. spacing, to avoid isotopic 
contamination. The following were used as exclusion criteria in selecting particular VMU's at the target masses: 1) 

10 functionalities incompatible with the synthetic sequence (e.g. esters); 2) elements with multiple isotopes (e.g.. C\, Br, 
S): 3) functionalities that might lead to compeing photoprocesses (iodides, acyl- and arylphenones); 4) racemic acids; 
and 5) availablity from vendors. 

[0551] After purification by column chromatography, the CMST ethyl ester (10) was recovered in variable yields. 
Base hydrolysis of (10) with NaOH gave the CMST acid (11) in quantitative yield. The final step, the fonmation of the 
^5 activated ester, was achieved using tetrafluorophenol trifluoroacetate and Hunnich's base, and resulted in the CMST 
TFP ester (12) in variable yields. 

[0552] It is convenient to attach the tags (in general) to the 5'-end of oligonudeotides so the 3* hydroxyl can be 
extended in the polymerase chain reaction or be available for other enzymatic modifications. Also, when used directly 
as probes as described in the application here, the tags are preferably attached to the 5'-end of the oligonucleotide 
20 probes. The guidelines described by Lukhtanov et. al., "Oligodeoxyribonucleotides with conjugated dihydropyn-oloin- 
dole oligopeptides: preparation and hybridization properties", Bioconjug Chem 6f4j:4 18-26. Jul-Aug. 1995. may be 
followed to prepared the tagged oligonucleotides from CMSTs and oligonucleotides. 

EXAMPLE 22 

25 

[0553] A PHREO photodeavage unit is placed between the HPLC and the mass spectrometer or an autoinjector and 
the MSD. "PHRED" which stands for Photochemical Reactor for Enhanced Detection, and Is available from Aura In- 
dustries, Staten Island, NY (available with both a 254 nm and 366 nm bulb, the 254 nm bulb was used). An inline device 
is preferably placed between the separation instrumentation (e.g., HPLC or gel) and the detector. The interface pref- 

30 erably has the following properties: the ability to collect the DMA fragments at discreet time intervals, concentrate the 
DNA fragments, remove the DNA fragments from the electrophoresis buffers and milieu, deave the MW-tdentifier from 
the DNA fragment, separate the MW-identifier from the Df^ fragment, dispose of the DNA fragment, place the tag in 
a volatile solution, volatilize and ionize the tag which Introduces the tag Into mass spectrometer. 
[0554] A suitable configuration of the photodeavage device is a 300 cm long, 8 watt, UV gemnicidal lamp (G8T50) 

35 (with an emmision around 366 nM) underwhich a 80 1 coil of 0.01 inch ID tefzel tubing is placed. Flow rates of 800 1 
per minute are generally suitable. Solution compositions which are compatible with APCl/MS and which contain only 
low concentrations of acetronltrile and buffers such as Tris-HCl are preferred. With this configuration, there is no re- 
quirement to separate the DNA firom the tag prior to the ionization step. The photolabile linker cleaves very rapidly 
under these conditions. The heat source in the APCI chamber contributes to the cleavage of the photolabile linker. 

40 [0555] By varying the length of the tefzel tubing coil under the UV source and holding the rate of flow constant, the 
residence time uiider the UV source was varied from the 0.75 to 6 seconds. The response factor (the integrated ion 
current produced per mole tag injected into the flow stream in "flow injection analysis" (Ft A)) of the tags was determined 
for a pool of 6 tags all tethered to a single oligonudeotide sequence (a 20-mer, M1 3 forward sequendng primer). The 
response factor is the integral of the efficiency of ionization, ion introduction into the vacuum chamber, and then sub- 

45 sequent detection by the MSD. Each tag in the pool was present at a concentration of 100 fmol per ul. The diluent was 
tRNA (Boehringer Mannheim) at a concentration of 1 ug/ml in HPLC-grade water. In TABLE 3, the response factor and 
the percentage of the signal observed relative to the longest exposure (6 seconds) are listed on a per tag basis. The 
results indicate that the tag Is rapidly deaved (less than 2 seconds) from the tethered oligonudeotide. In the time frame 
of 1 .9 to 6 seconds there was little decrease in the observed response factor with 6 different tagged-ODNs. At the 

50 shortest exposure time tested (0.75 seconds) there was up to a 25% decrease in the observed response factor. 



TABLE 3 



55 



Tag MW 


10 ul 


25 ul 


50 ul 


80 ul 


447 


4300 81% 


4400 83% 


5300 100% 


5300 100% 


455 


4900 92% 


5200 98% 


6400 120% 


5300 100% 


479 


3600 75% 


4100 85% 


4900102% 


4800 100% 
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TABLES (continued) 



Tag MW 


10 ul 


25 ul 


50 ul 


80 ul 


503 


4600 74% 


5700 92% 


6700 108% 


6200 100% 


507 


5200 85% 


5400 88% 


6500 104% 


6100 100% 


"'sir ~ 


4f00'78% 


"510098% 


5500105%- 





[0556] There was an approximate 40% decrease in the response factor of the six tags when the lamp In the photo- 
cleavage unit (data not shown). It has been detemiined that the photolabile linker is thermolabile and the tag is appar- 
ently cleaved during the APC ionization step where the vaporization temperature is 450PC. 

EXAMPLE 23 

COLLECTIVE TAG BEHAVIOR AND RELATIVE STABILITY 

[0557] Tags were designed to provide a single parental ion that did not fragment into daughter ions, or form adducts. 
The response of the tags was determined for a pool of 43 tags all tethered to a single oligonucleotide sequence (20, 
Ml 3 sequence). Each tag In the pool was present at a concentration of 100 fmol per ^il (the concentration was deter- 
mined by applying a dilution factor to a oiigo/tag stock that was measured spectrometricaliy at 260 nM. A conrection 
for the tag contibutlon to absorbance at 260 nM was not used. The diluent was tRNA (Boehringer Mannheim) at a 
concentration of 1 ^ig/ml in HPLC-grade water. The pool was stored at 4°C in the dark. CMST-tagged oligonucleotides 
may be handled under normal laboratory lighting conditions without occurrence appreciable degradation. 
[0558] Prior to analysis, a 55 jiJ aliquot was removed from the stock solution and placed in a 200 jiJ polypropylene 
autosampler vial and crimped closed. Five injections of 5 (il each from the pool and 3 injections of 5 ^1 each from the 
diluent was prefomied by the HPLC HP1 1 00 ALS. The APCI-MS chamber parameters were as follows: 20 PSI for the 
nebulizer pressure, vaporizer temperature was 450°C, drying gas flow was 3 L/min, drying gas temperature was 350*^0, 
corona cun*ent was 4 uA. fragmentor voltage 1 25 V, the gain was set to 1 , and the peak width 0.07 minutes. The flow 
rate was 0.8 ml/min., the "dead space" of the photocleavage unit was 80 jil, (0.01" ID Tefzel). The lamp of the photo- 
deavage unit operated at 366 nM. Each tag was quantified by extraction of the SIM ion from the TIC. Peaks were 
integrated under the following parameters: slope sensitivity at 2500, minumum peak area at 800, minimum peak height 
at 100, peak width at 0.15. shoulders settings at "off. Peak area for each tag was recorded for all five injections within 
an experiment and the average area was calculated along with the standard deviation and coefficient of variation. 
Average areas, standard deviations, and CVs were also calculated for a single day, over a two day period and for a 
three day period. Within a single day the coefRcient of variation varied between 2.0% and 9.9% between different tags. 
Over a three day penod the coefficient of variatton varied between 4.0% and 9.8% between different tags. The tags 
are therefore stable with respect to storage and pooling, as shown in TABLES 4a. 4b and 4c. 



TABLE 4a 



1-DAY 


Tag MW 


CV @ 10 fmol/injection 


CV @ lOOfmol/injectlon 


447 


2.0 


3.7 


455 


4.0 


6.5 


479 


6.9 


7.9 


503 


9.9 


2.9 


507 


4.1 


1.8 


511 


4.1 


4.0 , 
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TABLE 4b 



5 



10 



15 



20 



2-DAY 


Tag MW 


CV@ lOfmol/injection 


CV @ lOOfrnol/injection 


447 


3.0 


4.1 


455 


3.8 


8.6 


479 


4.8 


6.7 


503 


1.2 


5.6 


507 


5.5 


8.7 


511 


4.5 


5.6 


TABLE 4c 


3-DAY 


Tag MW 


CV@ lOfmol/injectlon 


CV @ lOOfmol/injection 


447 


3.3 


4.9 


455 


4.0 


3.9 


479 


5.6 


6.7 


503 


9.8 


8.9 • 


507 


2.0 


4.8 


511 


3.8 


6.7 



30 Sensitivity and Lower Limit of Detection 

[0559] Current mass produced quadrapoles have sensitivities comparable to fluorescent based sequencing. The 
sensitivity can be expressed in terms of the lower level of detection of the mass of the CMST We are defining the lower 
limit of detection as 3 standard deviations above the background of the assay system. The lower limit of detection of 
35 the pool of 43 conjugates was determined for the 30-ion SIM mode. 

[0560] A set of 10, two-fold dilutions were prepared in the tRNA/water diluent to give 500, 250, 125, 62.5, 31 .2, 15.6. 
7.8, 3.9, 1.9 and 0,9 femptomole of material per injection (5-10 ^1), The data was obtained in the 30-ion SIM mode 
under conditions described above. The LLD for each tag is shown in TABLE 5. 

40 TABLE 5 



45 



THE LOWER LIMIT OF DETECTION FOR 43 CMSTS. 


TagMW 


LLD 


Signal at LLD 


367 


30 


1970 


371 


30 


1707 


375 


15 


1318 


379 


15 


1642 


383 


30 


2585 


387 


15 


1301 


391 


15 


1554 


395 


15 


1960 


403 


8 


1784 


407 


15 


2266 
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TABLES (continued) 





THE LOWER LIMIT OF DETECTION FOR 43 CMSTS. 




Tag MW 


LLD 


Signal at LLD 




411 


15 


2285 




- - 415 


- 8 


1362 




419 


8 


1260 


10 


423 


8 


1530 




427 


15 


1607 




431 


8 


1606 


1^ 
lO 


439 


8 


2797 


443 


4 


1576 




447 


4 


1144 




459 


8 


1881 


20 


463 


8 


1179 




467 


8 


1625 




471 


2 


1094 


25 


475 


4 


1664 


479 


2 


2626 




483 


8 


1528 




487 


8 


1556 


30 


495 


15 


1894 




499 


15 


1465 




503 


4 


1208 


35 


507 


8 


1843 


511 


8 


1453 




515 


8 


1276 




519 


8 


1042 


40 


523 


15 


1690 




527 


8 


815 




531 


8 


1521 


45 


535 


8 


1276 


539 


4 


1651 




543 


15 


1076 



[0561] The LLD for this particular set of tags is approximately 4-30 femtomoles per injection following the photo- 
cleavage step from the oligonucleotide: (30 x 10-15 moles @ 500 MW —> 5 x 10-11 gram — > 50 picograms of tag. 
100 X 10-15 mole at 330 x 400 nt x 2 = 3 x 10-8 gram = 30 nanograms of a 400 nt double strand PGR product). 
Therefore, assuming an average 25 ^1 PGR reaction contains 300 ng of a double strand PGR product, about 1/10th 
of that product will be used to generate a signal that Is statistically above the background of the measurement. For a 
means of comparision, approximately 100-200 ng of DNA per lane on the ABI 377 sequencer was used. 
[0562] The lower limit of detection of the tagging system effects the level of multiplexing that can be achieved in a 
single injection. To date, we have not noted any problem with the measurement process when up to 10 |ig of PGR 
product is used per injection. The implication of this number is that 10,000/30 or about 300 reactions can be multiplexed 
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using the tagging system of the present invention. Therefore, for the HP-MSD1100, the level of multiplexing is about 
the same as the tags that can be placed in the spectrum of the quadrapole MS. 

Tag Interference 

5 

[0563] There is little or no intereference in terms of response (response being the measureable ion current that 
reflects the extent of ioni^tiohrdegreenof fragmehtatibn~orliegradationradd 

numbers of CMSTs of the present invention. The multiplexing of tags does not affect the response of individual tags 
and therefore multiplexing does not affect ionization or total ion cun-ent. 

10 

Response factor as a function of injection volume 

[0564] As described above, a pool of 43 conjugates were measured to determine the response factor as a function 
of injection volume. 50 fmol per injection and 500 fmol per injection were measured in 5, 10, 20, 50 and lOOfil volumes. 
15 The APCI-MS parameters were identical to those used in the tag stability experiment. The values shown in TABLE 4 
are an average of 5 replicate injections for each volume measured. There was no decrease in the RF between 5 ^1 
and 10 nl volumes. At the 20 jil Injection volumes the RF was 90-97% of the 5 volume. When 50 \i\ volumes were 
measured, the RF was 54%-75% of the 5 \i\ volume. There was no detectable signal using the 1 00 )xl Injection volumes. 

20 EXAMPLE 24 

P4502D6 POLYMORPHISM DETECTION 

[0565] The CMST technology platform described herein may be used to measure the CYP2D6 polymorphisms which 

25 are responsible for the metabolism of debrisoquine 4-hydroxylase. This particular P450 cytochrome is important in the 
metabolism of more than 30 drugs and xenobiotic compounds. CYP2D6 (P450-2D6) is estimated to be responsible 
for metabolizing approximately 25% of the cun-ently prescribed drugs. The CYP2D6 is also known as debrisoquine/ 
sparteine hydroxylase. Sache. Am. J. Human Genetics, 60:284-295. 1 997, has estimated that up to 1 0% of Caucasians 
are poor metabollzers due to either Inactivating mutations in both ailels or complete lack of the gene. There Is also a 

30 braod range of CYP2D6 activities in addition to poor metabollzers. The mutations selected for study Include C188T. 
G212A. delT1795. G1846T/A, G1934A, delA2637, C2938T. and G4268C. The RFLPS were detected by gel electro- 
phoresis as previously desaibed (see Gough et. al., Nature 347:773-776. 1990). Primers used for RFLP corresponds 
to those used by Sachse et al. Am. J. Hum. Genet. 60:284-295, 1997. Primers used for sequencing coa^sponds to 
those used by Meyer et al. Pharmacogenetics 5:373-384, 1995. 

35 [0566] The principle of the CMST-based assay was to Immobilize one strand of the amplified CYP2D6 exon on a 
solid phase (e.g., magnetic particles), hybridize the oligonucleotide probes, wash away unhybridized material, elute 
the hybridized probe and then detect the mass spec tag by mass spectrometry (after cleaving the tag from the probe). 
[0567] The amplification conditions were as follows. Primers flanking the 2D6 gene (Sachse et, al., Am. J. Hum. 
Genet. 60:284-295. 1997) were used to amplify a 4.681 b.p. genomic DNA fragment containing all of the relevant gene 

40 sequence. The PCR reaction was composed of IX Expand HF buffer, 1.5mM MgCI2. 200jiM dNTP's, 0.5^iM primers 
P100 & P200. 0,5% formamide, lOOng gDNA. and I.IU Expand^ High Fidelity enzyme mix (Boehrlnger Mannheim). 
Thermocycling conditions were as follows: 94°C for 3 minutes; 1 0 cycles of 94''C for 30 seconds, 62°C for 30 seconds, 
and 68°C for 4 minutes; 20 cycles of 94*C for 30 seconds, 62°C for 30 seconds, and SS'^C for 4 minutes +20 seconds/ 
cycle; 68'C for 10 minutes. Product were visualized on a 1.0 % agarose gel stained with ethidium bromkJe. 

45 [0568] The assay fonmat Is described as follows. Streptavldin magnetic particles (Promega Magnesphere. binding 
capacity of 80 pmol biotin/1 00 jig particles) were washed with low salt wash and binding buffer (LSWBB, 1 00 mM NaCI, 
I mM EDTA, 10 mM Tris, pH 7.5) and then resuspended in high salt wash and binding buffer (HSWBB. 2 M NaCI. 1 
mM EDTA, and 10 mM Tris pH 7.5) at a concentration of 2000 ^ig/ml. The biotinylated PCR products were Incubated 
with the streptavldin particles for 2 hours at 21''C with constant rotary mixing. The particles were washed twice with 

so 200 pi of HSWBB and once with 200 ^1 of LSWBB. The bound PCR amplicons were then denatured by treatment with 
50 p-l of 0.1 N NaOH for 10 minutes at 21**C. The particles were then washed once with 50 ^1 of 0.1 N NaOH and three 
times with 200 \i\ LSWBB. The particle-bound amplicons were then hybridized with equal molar mixtures of wild-type 
(wt) and mutant (mt) probes possessing different mass tags. Fifty picomoles of respective probe was placed in 200 pi 
of 2 m GuSCN. 5 mM EDTA and 1 0 mM Tris pH 7.5 and 50 pi of the hybridization solution was placed with the particles. 

55 Hybridization was for 1 hour at 21 ''C with constant rotary mixing. The particles were washed 5 times with LSWBB and 
the tubes were changed after the second wash. The hybridized probes were eluted from the particles by treatment of 
the particles with 20 of 0.1 N NaOH and a following wash of 9 jil of 0.1 N NaOH. The solution was then neutralized 
with 3 pi of 1 M acetic acid. Five pt of this solution was then injected Into the mass spectrometer (HP 1100 series LC/ 
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MS equipped with a vacuum degasser, binary pump, autosampler and diode array detector). The mass spectrometer 
was used with the APCl source option. HP LC/MSD Chemstatlon software was used for system control, data acqulstlon 
and data analysis installed on a HP vectra XA with the Windows NT woricstation version 4.0 operating system). The 
flow steam into the MS consists of 50% acetonitrile in ultra-pure water at a flow rate of 800 [il/minute. The photochemical 
5 cleavage device consisted of 254 nm low pressure mercury lamp, a UV transparent reactor coil and a lamp holder 
(Aura Industries). 

[0569] Representative-results are shown in"TABLE6r — _ 



TABLE 6 



10 



Individual 


Exon 


mAU (wt) 


mAU (mt) 


CMST call 


RFLP call 


1362 PF13 


4 


0 


190,000 


M/M 


M/M 


1362 PM14 


4 


152,000 


0 


W/W 


W/W 


1362 MF15 


4 


149.000 


53.000 


W/M 


W/M 


1377 C1 19 


6 


0 


271,000 


M/M 


M/M 


1377 C2 20 


6 


104,000 


88,000 


W/M 


W/M 


1377 C3 21 


6 


290,000 


0 


W/W 


W/W 


1377 CI 19 


9 


0 


74,000 


M/M 


M/M 


1377 C2 20 


9 


38.000 


41.000 


W/M 


W/M 


1377 C3 21 


9 


149,000 


0 


W/W 


W/W 


CONTROL 




0 


0 


NONE 


NONE 



EXAMPLE 25 

25 

GENE EXPRESSION MONITORING WITH CMST-TAGGED ODNS 

[0570] Total RNA (1-2ug) from the A549 human cell line was transcribed using Superscript II reverse transcriptase 
and oligodT-primer in a final volume of 22^1 according to manufacturer's instructions (Life Technologies; Gaithersburg, 

30 MD). A 75bp region of the apoptosis-related human DAD-I gene coding region spanning an intro-^xon boundary was 
amplified by Taq polymerase chain reaction (PCR); an initial denaturation of 95''C (5 min) was followed by 25-60 cycles 
(annealing at 45°C for 1 5 sec and denaturation at 95°C for 5 sec). The PCR reaction (20-200^1) contained 0.5fiM DAD- 
1 reverse primer (5*-biotin-CCA GGA AAT TCA fiJKG AGT GA-3'), 0.0125jiM or 0.5nM DAD-1 phosphorylated forward 
primer (5'-TTG GCT GAA TCA TTC TCA TT-3'). 7 X 102-1.2 X 10^ molecules of internal standard (5'-CC AGG AAA 

35 TTC AAA GAG TGA ACA TTC TTT TTG TGT CG-3'), Ijil A549 cDNA or 8 X 10^- X 10^1 molecules of WT mimic 
(5'-CCA GGA AAT TCA AAG AGT GAA CAT TCT TTT AGT CTC CTA CTC CTC AAT TAA GTA AAT GAG AAT GAT 
TCA GCC AA-3'). 0.8U Taq polymerase. 0.2mM dATP. 0.2mM dCTP, 0,2mM dGTP, 0.2mM dTTP, 1 .5mM MgClz, 50mM 
Kcl, and 1 0mM Tris-HCI pH 8.3. 

[0571] The amplification product (50nJ) was rendered single-stranded by either assymetric amplification conditions 

40 (above) or by digestion with 2.5U Lambda exonuclease (Boehringher Mannheim; Indianapolis. IN) for 15 min (37°C) 
in 18mM Tris-HCI pH 9.5, 1.8mM MgCl2, 28mM Kcl. The digested amplicon or assymetric PCR reaction was placed 
at 70°C for 5 min, cooled briefly (room temperature), and adjusted to 2,1 M guanldine isothiocyanate. 50mM Tris-HCI 
pH 7.5. 0.1% saricosyl, 1^ig/ml tRNA, 5^g/ml WT 394 probe (5*-394 MW CMST tag-TTG AGG AGT AGG AGA CTA 
AAA-3'), 5fig/ml IS 390 probe (5'-390 MW CMST tag-TTGACGACTACGACACAAAAA-3'). 

45 [0572] The hybridization reaction was incubated 10 min (RT) and transferred to a 30K MWCO spin filter (Millipore 
Corp.; Bedford, MA) containing 7jil of 286[ig/ml Avidin-DN (Vector; Burlingame, CA) and 0.7ng/ml tRNA. The spin filter 
was incubated 5-10min (RT) and centrifuged (4,000xg, lOmin). The spin filter was washed twice with four-hundred 
microliters of cold HPLC-grade dH20 and centrifuged (4,000xg, 10min). Twenty-five microliters of l^ig/ml tRNA were 
added to the spin filter to elute the retained hybrid. The 25[i\ retentate was injected in 50% acetonitrie through a pho- 

50 tolysis unit into an HP mass spectrometer (APCl positive mode). Single ion measurements were made for tags having 
molecular weights of 390 and 394, and the results displayed as peak areas. The ratio of the 390 / 394 tag signals was 
proportional to the amount of amplicon generated from either the intemal standard or the A549 DAD-1 cDNA or WT 
mimic. Calculation of the unknown number of RNA molecules in the A549 total RNA or number of WT mimics could 
be made from a standard curve as shown on the log/log plot of the number of input intemal standard molecules vs. 

55 the ratio of the 390 / 394 tag signals. The unknown is the value of x when y=0. Alternatively, the unknown = the ratio 
of 390 / 394 X the number of input intemal standard molecules, when the ratio is between 0.3 and 3. 
[0573] The results are shown in Figure 23. 
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EXAMPLE 26 

SINGLE NUCLEOTIDE EXTENSION ASSAY 

5 [0574] RNA preparation: Total RNA was isolated was prepared from Jurkat cells using (starting with 1 x 10^ cells in 
exponential growth) using an RNA isolation kit from Promega (Wi). RNA was stored in two aliquots: (1) stock aliquots 
in diethyl pyrocartx>nate-treated ddH20 was stored at--20**G, and (2) long term storage as a suspension in-1 00% H2O;- 
[0575] Reverse Transscription: Poly(dT) primed reverse transcription of total RNA was performed as described as 
described in Ausubel et al. (Ausubel et. al„ in Current Protocols in Molecular Biology. 1991 , Greene Publishing Asso- 

10 ciates/Wiley-lnterscience, NY, NY.) except that the reaction(s) were scaled to using 1 |ig of input total RNA. 20-50 units 
of reverse transcriptase (Promega) was diluted 10-fold in 10% glycerol, 10 mM KP04 pH 7.4, 0.2% Triton X-100. and 
2 mM DTT and placed on ice for 30 minutes prior to addition to the reactions. Gene-specific reverse transcription for 
GADPH and other control genes as described in the figures and tables were perfomied using 1 fig of total Juricat RNA 
reversed transcribed in 10 mM Tris-HCI pH 8.3, 50 mM KCI, M MgCL2, 1 mM dNTPs, 2 U/fil RNAsin (Gibco-BRL). 0.1 

15 oligomer and 0.1 25 U/fil of M-MLV reverse transcriptase (Gibco-BRL) in 20^J reactions. Reactions were incubated 

in at 42''C for 15 minutes, heat inactivated at 95°C for 5 minutes, and diluted to 100 |xl with a master mix of (10 mM 
Tris HCI pH.8.3. 1 mM NH4CI. 1 ,5 M MgCla. 1 00 mM KCI). 0.125 mM NTPs. 10 ng/ml of the respective oligonucleotide 
primers and 0.75 units of TAQ polymerase (Gibco-BRL) in preparation for PCR amplification. 
[0576] PCR: PCR for each gene was performed with gene specific primers spanning a known intron/exon boundry 

20 (see tables below). All PCRs were done in 20^1 volumes containing 10 mM Tris HCI pH 8.3, 1 mM NH4CI, 1 .5 M MgCl2. 
1 00 mM KCI), 0.1 25 mM NTPs, 10 ng/ml of the respective oligonucleotide primers and 0.75 units of TAQ polymerase 
(Gibco-BRL). Cycling parameters were 94''C preheating step for 5 minutes followed by 94^*0 denaturing step for 1 
minute, 55**C annealing step for 2 minutes, and a 72*C extension step for 30 seconds to 1 minute and a final extension 
at 72°C for 10 minutes. Amplifications were generally 30-45 in number. 

25 ' [0577] Purification of templates: PCR products were gel purified as described by Zhen and Swank (Zhen and Swank. 
1993, BioTectiniques. 14(6), p894-898.). PCR products were resolved on 1% agarose gels run in 0.04 M Tris-acetate, 
0.001 M EDTA (Ix TEA) buffer and stained with ethidium bromide while visualizing with a UV light source. A trough 
was cut just in front of the band of interest and filled with 50-200 \i\ of 10% PEG in Ix TAE buffer. Electrophoresis was 
continued until the band had completely entered the trough. The contents was then removed and extracted with phenol. 

30 cholorfonm extracted, and then precipitated in 0,1 volume of 7.5 M ammonium acetate and 2.5 volumes of 100% EtOH, 
Samples were washed with 75% EtOH and briedly dried at ambient temperature. Quantitation of yield was done by 
electrophoresis of a small aliquot on 1 % agarose gel in Ix TBE buffer with ethidium' bromide staining and comparision 
to a known standard. 

[0578] Each SNuPE reaction was earned out in a 50 ^1 volume containing about 1 00 ng of the amplified DN A fragment. 

35 1 ^M of the SNuPE primer, 2 units of Tag polymerase, and 1 of the appropriate dNTR All dNTPs are unlabelled in 
this type of assay. The buffer used was 10 mM Tris-HCI (pH 8.3), with 50 mM KCI, 5mM MgCl2, and 0.001% (wt/vol) 
gelatin. The samples were subjected to one cyde consisting of a 2-minute denaturation period at 95°C. a 2 minute 
annealing period at 60*^0 and a 2 minute primer extension period at 72*0. The sequence of the SNUPE primer for 
each family is described below. 

^ [0579] Primer extensions: Single nucleotide primer extensions were performed as described in Singer-Sam et. al., 
(Singer-Sam et al.. 1992, PCR Methods and Applications, 1, p160-163) except that 1 mM Mg++, 0.1 \lM primer, and 
0.05 jiM of each dNTP type was used in each reaction type. After each primer extension described above, one-fifth 
volume of a loading dye (80% formamide, 0.1 % bromophenol blue, 0.1 % xylene cyanol, 2 mM EDTA) was added, 
and the netire sample electrophoresed in 15% denaturing polyacrylamide gel. Gels were fixed in 10% glycerol. 10% 

45 methanol. 10% glacial acetic add with constant shaking followed by washing steps with 10% glycerol. The gels were 
then dried at 55 C for 3-5 hours. 

[0580] The primers described in this experiment are described by Rychlik (Rychlik, (1995) BioTechniques, 18, 
p84-90). Primers may be synthesized or obtained as gel-filtration grade primers from Midland Certified Reagent Comap- 
ny (Midland . Texas). The amplifications are either TAQ DNA polymerase-based (10 mM Tris-HCI pH 8.3, 1.5 mM 

50 MgCl2. 50 mM KCI) or Pfu DNA polymerase-based based (20 mM Tris-HCI pH 8.3. 2,0 mM MgClj, 1 0 mM KCI, 1 0 mM 
(NH4)2S04, 0.1% Triton X-100, 0.1 mg/ml bovine semm albumin). The total nucleoside triphosphate (NTPs) concen- 
tration in the reactions is 0.8 mM, the primer concentration is 200 nM (unless othenwise stated) and the template amount 
is 0.25 ng of bacteriophage lambda DNA per 20 \i\ reaction. Cyding parameters were 94°C preheating step for 5 
minutes followed by 94*^0 denaturing step for 1 minute. 55''C annealing step for 2 minutes, and a 72**C extension step 

55 for 30 seconds to 1 minute and a final extension at 72^C for 1 0 minutes. Amplifications were generally 30 -45 In number. 
[0581] Two regions in the bacteriophage lambda genome (GenBank Accession #J02459) were chosen as the priming 
sites for amplification. The 5'-primer has a stable GC-rich 3*-€nd: the 3' primer is chosen so that a 381 bp product will 
result. The 5* fonward primer Is H17: 5-GAACGAAAACCCCCCGC. The 3*-reverse primer is RP17: 
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JO 



15 



20 



25 



5-GATCGCCCCCAAAACACATA. 

[0582] The amplified product was then tested for the presence of a polymorphism at position 31245. The following 
primer was used in four single nucleotide extension assays; SNE17: 5*-GAACGAAAACCCCCCGC. The four single 
nucleotide extension assays were then carried as described above. All the reactions are then pooled and 5 ^1 of the 
pooled material was injected onto the HPLC column (SeraSep, San Jose, CA) without further purification. 
[0583] HPLC was carried out using automated HPLC instrumentation (Rainin. Emeryville, CA, or, Hewlett Packard, 
" Palo"Alto7 CA): Uhpurifie'd SNEA" products which had been denatured for 3 minutes at 95 prior into injection into an- 
HPLC were eluted with linear acetonitrile (ACN, J,T. Baker, N J) gradient of 1 .8%/minute at a flow rate of 0.9 ml/minute. 
The start and end points were adjusted according to the size of the SNEA product. The temperature required for the 
successful resolution of the SNEA molecules was 50**C. The effluent from the HPLC was then directed into a mass 
spectrometer (Hewlett Packard, Palo Alto, CA) for the detection of tags, with the results shown in TABLE 7. 

TABLE 7 



Tagged Primer 


ddNTP Type 


Retention lime 


Extended? 


SNET 7-487 


ddATP 


2.5 minutes 


no 


SNET7-496 


ddGTP 


2.5 minutes 


no 


SNET7-503 


ddCTP 


4.6 minutes 


yes 


SNET7-555 


ddTTP 


2.5 minutes 


no 



[0584] The results of TABLE 7 indicate that the mass spectrometer tag (CMST) was detected at a retention time of 
4.6 minutes indicating that the SNE17 primer was extended by one base (ddCTP) and therefore the polymorphism at 
position 31245 was in this case a "G". The SNE17-487. SNE1 7-496, and SNE17-555 tagged primers were not extended 
and their retention times on the HPLC was 2.5 minutes respectively 



EXAMPLE 27 



30 



PHORPHORAMIDITE CHEMISTRY FOR TAGGED MOLECULE SYNTHESIS 
Preparation of aminohexyi tailed tags 



35 



[0585] As shown in Figure 26, preparation of pure aminohexyltailed tag (8) was achieved utilising the TFP method- 
ology. The TFP ester was prepared by reaction of the lithium salt of tag 1 66 (9) with excess TFP-TFA and Hunigs base 
in OMR After wori<up a dichloromethane solution TFP ester was treated with 6-aminohexanol. A precipitate of (8) was 
formed immediately, and was isolated by filtration. 



40 



Preparation of tag phosphoramidites 

[0586] Reaction of the aminohexyi tailed tag (5) with chlorophosphoramidite gave the desired phosphoramidite (1 7) 
with no phosphonate detected by mass spec analysis. 



45 



i-Pr^-N-P^ ^ ^ 

0(CH2)6NHCOTagi66. 



/-Pr244-P, 



.OCHjCHzCN 
^OCDa 



50 



55 



(17) 



(18) 



[0587] Excess phosphytilating agent was quenched with deuterated methanol (H3-methanol is usually used^ but use 
of deuterated methanol in this case allows unambiguous identification of the origin of quench products) giving rise to 
(18). This material survived the subsequent aqueous woricup and as it is itself a phosphoramidite it has the potential 
to interfere in subsequent reactions. 
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Reaction of Phosphoramidite with Polymer Supported Thymidine 

[0588] The thymidine is linked to controlled pore glass beads which are contained within a plastic cartridge. The 

cartridges used contain 1000 nM (1x10"^ moles) of supported thymidine. The 5' hydroxyl of the base is protected as 
5 the dimethoxytrityl ether which is removed with 3% trichloroacetic acid in dichloromethane prior to reaction with the 

phosphoramidite (Figure 27). After reaction with the phosphoramidite the intennediate phosphite triester (19) is oxidised 
to the more stable phosphotriester (20). Concurrent removal of the cyanoethyl protecting group and cleavage from the- 

solid support is carried out with ammonia to give the tagged thymidine (21 ). 

[0589] Crude material from the phosphoramidite preparation was used in this sequence and the material cleaved 
10 from the support analysed by mass spectrometry. The major ions observed in positive mode along with assignments 
are shown in TABLE 8. 



TABLE 8 



Mass 


Assignment 




O o- 

0^ ^~^0(CH2)6NHCOTag,66 
Loss of thymine and water 


711 • 


0(CH2)6NHCOTagi66 
Loss of thymidine and phosphate 


419 


O ^ H 

H 0 
Photocleavage product. 



[0590] It appears that under the conditions used the phosphate ester linkage is prone to fragmentation. Analysis in 
negative mode showed a peak at 338 amu conesponding to the compound (22) this arises from the reaction of the 

<5 quench product (18) (Figure 28) which Is a major contaminant of the cnjde. 

[0591] The tag -phosphoramidite (17) could be separated from the non-polar contaminant (1 8) by silica gel chroma- 
tography (10% methanol / 89% dichloromethane /1% triethylamine) and 115mg of (17) was obtained. This material 
was used in the reaction cycle to give (21) detected as its cleavage product (5) in the mass spec (HPLC analysis of 
the crude showed that (5) was not present thus the peak at 711 amu in the mass spec must have arisen by cleavage 

50 of the desired product (21 ) in the instrument). Iso present was unreacted thymidine indicating that the reaction had not 
gone to cmpletion. The material was also passed through the UV cleavage flow system and the photocleavage product 
(23) detected by mass spectrometry. 
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20 [0592] Therefore, it is possible to prepare a mass spec tag phosphoramidite and to purify and react it with a polymer 
bound substrate which can be subsequently oxidised, deprotected and cleaved from the support to give a tagged base 

1. "Oligonucleotide Synthesis, a Practical Approach" ed M.J.Gait, IRL Press 1990. 

2. R.T Pop. TefraWno;? /eft, 1991. 1715. 

25 3. Available from Glen Research Cat no 1 0-1 953-xx. 

4. "Principles of Peptide Synthesis" M.Bodansky. Springer-Verlag 1984. 

5. A. J Cocuzza, Tetrahedron lett, 1989, 6287. 

6. N.N.Poluskin. B.Chen. L.W.Anderson and J.S Cohen. J. Org, Chem, 1993, 58, 4606. 
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Claims 

1 . A compound of the formula: 



5 



wherein, 

10 yns js an organic group detectable by mass spectrometry, comprising cart)on, at least one of hydrogen and 

fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine; 
L is an organic group which allows a unique T^-containing moiety to be cleaved from the remainder of the 
compound, wherein the P"s-containing moiety comprises a functional group which supports a single ionized 
charge state when the compound is subjected to mass spectrometry and is tertiary amine, quaternary amine 

IS or organic acid; 

X is a functional group selected from phosphoramidite and H-phosphonate. 

2. The compound of daim 1 wherein X is a phosphoramidite group such that P"s.l_x has the structure 



20 



25 



30 



35 



40 



45 



55 



wherein R is an alkyi group or a substituted alkyl group having one or more substituents selected from halogen 
and cyano, and the two R groups of NR2 may be bonded together to form a cydoalkyl group. 

3. The compound of claim 2 wherein X is a phosphoramidite group such that P^-L-X has the structure 



Ta5CH2— CONH— (CH2)6— O^p^Ol?, 

l!lR2 

and OR is OCH2CH2CN while NR2 is N(iso-propyl)2. 
4. The compound of claim 1 wherein X has an H-phosphonate group such that T^-L-X has the structure 



50 wherein R is a C^-Cgalkyl group. 

5. A compound according to claims 1 -4 wherein has a mass of from 1 5 to 1 0.000 daltons and a molecular formula 
of Ci^oo'^o-iooOo'ioo^io^o-io^a^p's wherein the sum of a, p and S is sufficient to satisfy the othennnse unsatisfied 
valencies of the C, N and O atoms. 

6. A compound according to claims 1-4 wherein T^ and L are twnded together through a functional group selected 
from amide, ester, ether, amine, sulfide, thioester, disulfide, thioether, urea, thiourea, carbamate, thiocarbamate, 
Schiff base, reduced Schiff base, imine. oxime, hydrazone, phosphate, phosphpnate. phosphoramide. phosphona- 
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mide, sulfonate, sulfonamide or carbon-carbon bond. 

7. A compound according to claims 1-4 wherein L is selected from L^^^, L^^id^ Lbase^ lioj^ \jr]^ lenz^ Leic^ LA and L^s, 
where actinic radiation, acid, base, oxidation, reduction, enzyme, electrochemical, thermal and thiol exchange, 
5 respectively, cause the T"^s-containing moiety to be cleaved from the remainder of the molecule. 

Sr A compound according to claims 1"-4"wherein L^^^ has the formula ti-t2-L3rwherein is a molecular fragment 
that absorbs actinic radiation to promote the deavage of from X, and and are independently a direct 
bond or an organic moiety, where separates from T^s separates from X, and neither nor 

10 undergo bond cleavage when absorbs the actinic radiation. 

9. A compound according to claim 8 wherein -L^-L^ has the formula: 



d 



20 




25 with one carbon atom at positions a, b, c, d or e being substituted with -L^-X and optionally one or more of 

positions b, c, d or e being substituted with alkyi, alkoxy, fluoride, chloride, hydroxyl, cart>oxylate or amide; and 
is hydrogen or hydrocarbyl. 

10. A compound according to claim 8 wherein is selected from a direct bond, a hydrocarbylene, -O-hydrocarbylene, 
30 and hydrocarbylene-{0-hydrocarbylene)n-H, and n is an integer ranging from 1 to 10. 

11. A compound according to claims 1-4 wherein -L-X has the formula: 




45 wherein one or more of positions b, c. d or e is substituted with hydrogen, alkyi, alkoxy. fluoride, chloride, 

hydroxyl, carboxylate or amide; is hydrogen or hydrocarbyl. and terminates In an "X" group. 

12. A compound according to claims 1-4 wherein has the formula: 

T2 is an organic moiety formed from carbon and one or more of hydrogen, fluoride, iodide, oxygen, nitrogen, 
sulfur and phosphorous, having a mass of 1 5 to 500 daltons; 
55 is an organic moiety formed from carbon and one or more of hydrogen, fluoride, iodide, oxygen, nitrogen, 

sulfur and phosphorous, having a mass of 50 to 1000 daltons; 

J is a direct bond or a functional group selected from amide, ester, amine, sulfide, ether, thioester, disulfide, 
thioether, urea, thiourea, carbamate, thiocarit}amate. Schiff base, reduced Schiff base, imine. oxime. hydra- 



106 



EP 0 990 047 B1 



zone, phosphate, phosphonate, phosphoramide, phosphonamide, sulfonate, sulfonamide or carbon-carbon 
bond; and 

n is an integer ranging from 1 to 50, and when n is greater than 1 , each and J is independently selected. 

13. A compound according to claim 12 wherein T2 is selected from hydrocarbyl. hydrocarbyl-O-hydrocarbylene, hy- 
drocarbyl-Srhydrocarbylene.hydrpcarbyl-NH^hydrocarbyl^n^.Jhydro^ N-(hydrocarb- 

yl)hydrocarbylene, N,N-di(hydrocarbyl)hydrocarbylene, hydrocarbylacyl-hydrocarbylene, heterocycl7lhydrocart5yr 
wherein the heteroatom(s) are selected from oxygen, nitrogen, sulfur and phosphorous, substituted heterocydyl- 
hydrocarbyl wherein the heteroatom(s) are selected from oxygen, nitrogen, sulfur and phosphorous and the sub- 
stituents are selected from hydrocarbyl, hydrocarbyl-O-hydrocarbylene, hydrocarbyl-NH-hydrocarbylene, hydro- 
carbyl-S-hydrocarbylene, N-(hydrocarbyl)hydrocarbylene. N.N-di(hydrocarbyl)hydrocarbyiene and hydrocarbyla- 
cyl-hydrocarbylene, as well as derivatives of any of the foregoing wherein one or more hydrogens is replaced with 
an equal number of fluorides. 

14. A compound according to claim 12 wherein has the formula - G(R^y, G is C^^ alkylene having a single R2 
substituent, and R2 is selected from alkyl. alkenyl, alkynyl, cydoalkyl. aryl-fused cydoalkyl, cycloalkenyl, aryl, 
aralkyi, aryl-substituted alkenyl or alkynyl, cydoalkyl-substituted alkyl, cydoalkenyl-substituted cydoalkyl, biaryl, 
alkoxy, alkenoxy. alkynoxy, aralkoxy. aryl-substituted alkenoxy or alkynoxy, alkytamino, alkenylamino oralkynylami- 
no, aryl-substituted alkylamino, aryl-substituted alkenylamino or alkynylamino, aryloxy. arylamino, N-alkylurea- 
substituted alkyi. N-ary!urea-substituted alkyl, a!kylcart)onylamino-substituted alkyl. aminocarbonyl-substituted 
alkyt. heterocyclyl. heterocydyl-substituted alkyl. heterocydyl-substituted amino, carfooxyalkyi substituted aralkyi. 
oxocarbocydyl-fused aryl and heterocydylalkyl; cydoalkenyl, aryl-substituted alkyl and, aralkyi, hydroxy-substi- 
tuted alkyl, alkoxy-substituted alkyl. aralkoxy-substituted alkyl, alkoxy-substituted alkyl, aralkoxy-substituted alkyl, 
amino-substituted alkyl. (aryl-substituted alkyloxycarbonylamino)-substituted alkyl. thiol-substituted alkyl, alkylsul- 
fonyl-substituted alkyl, (hydroxy-substituted alkylthio>-substituted alkyl. thioalkoxy-substituted alkyl. hydrocarbyla- 
cylamino-substituted alkyl, heterocydylacylamino-sut)stituted alkyl, hydrocarbyl-substituted-heterocyclytacylami- 
no-substituted alkyl, alkylsulfonylamino-substituted alkyl, arylsulfonylamino-substituted alkyl, morpholino-alkyi, tht- 
omorpholino-alkyl, morpholino carbonyl-substituted alkyl, thiomorpholinocarbonyl-substituted alkyl, (N-(alkyl, alke- 
nyl or alkynyl)- or N.N-[dialkyl, dialkenyl, dialkynyt or (alkyl, alkenyl)-amino]carbonyl-substituted alkyl, hetero- 
cyclylaminocarbonyl, heterocylylalkyleneaminocarbonyl, heterocydylaminocarbonyl-substituted alkyl, heterocyly- 

. lalkyleneaminocarbonyl-substituted alkyl, N,N-[dialkyi]alkyleneaminocarbonyl, N,N-[dialkyl]alkyleneaminocarbo- 
nyl-substituted alkyl, alkyl-substituted heterocydylcartx)nyl, alkyl-substituted heterocydylcarfx>nyt-alkyl, carboxyl- 
substituted alkyt, dialkylamino-substituted acytamlnoalkyi and amino acid side chains selected from arginine, as- 
paragine, glutamine. S-methyl cysteine, methionine and corresponding sulfoxide and sulfone derivatives thereof, 
glycine, leucine, isoleucine, allo-isoleucine, tert-leudne. norleucine, phenylalanine, tyrosine, tryptophan, proline, 
alanine, omithine, histidine. glutamine, valine, threonine, serine, aspartic add, beta-cyanoalanine, and allothreo- 
nlne; alynyl and heterocydylcarbonyl. aminocarbonyl. amido, mono- or dialkylaminocartwnyl, mono- or diarylami- 
nocarbonyl, alkylarylaminocarbonyl, diarylaminocarbonyl, mono- or diacytaminocarbonyl, aromatic or aliphatic 
acyl, alkyl optionally substituted by substituents selected from amino, carboxy, hydroxy, mercapto, mono- or di- 
alkylamino. mono- or diarylamino, alkylarylamino, diarylamino. mono- or diacylamino. alkoxy, alkenoxy. aryloxy, 
thioalkoxy. thioalkenoxy. thtoalkynoxy. thioaryloxy and heterocydyl. 

15. A compound according to claim 12 having the formula: 



1^ 
I 

Amide 




wherein 
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G is (CHj),^ wherein a hydrogen on one and only one of the CH2 groups is replaced with-(CH2)c-Amide-T^; 
T2 and are organic moieties of the formula C,.25No.gOo.gH„Fp wherein the sum of o and p is sufficient to 
satisfy the otherwise unsatisfied valencies of the C, N, and O atoms; 



o o 

ti n 

Amide is— N-C — or — C-N— ; 

r1 R» 



10 



is hydrogen or C^.^q aikyi; 
c is an integer ranging from 0 to 4; 
X is deflned according to claim 1 ; and 
'5 n is an integer ranging from 1 to 50 such that when n is greater than 1 , G, c. Amide, and T"* are independently 

selected. 

16. A compound according to daim 15 having the formula: 

20 

I 

Amide 
I 



30 



I' 0 (CHJk 



Amide 

35 



wherein is an organic moiety of the formula C^.25No,90o.9Hjj(Fp wherein the sum of a and p is sufficient 
to satisfy the otherwise unsatisfied valencies of the C, and O atoms; and includes a tertiary or quaternary 
^0 amine or an organic acid; and m Is an integer ranging from 0-49. 

17. A compound according to claim 15 having the formula: 



45 



t 

Anude 

(CH2)c O 



50 




55 



Atnkle 



wherein is an organic moiety of the formula Ci.25^o-9^o-9'~^a^p wherein the sum of a and p is sufficient 
to satisfy the otherwise unsatisfied valencies of the C. N. and O atoms; and includes a tertiary or quaternary 
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amine or an organic acid; and m is an integer ranging from 0-49. 
18. A compound according to any one of claims 16 and 17 wherein -Amide-T^ is selected from: 



10 



15 



20 



25 



O I 



— NHC-Y N-(C,~Cio); and — NHC— (C,— C|o)-N 



19. A compound according to any of daims 16 and 17 wherein -Amide-T^ is selected from: 



30 N— X 



CNH-(C,— Cio)— ^ y . — CNH— (Cg-Cio)-N^ O ; 



O 



35 



(c,— Cio) (Cr-c,o) 



I 



— CNH— (CV-Cioh-N \ ; — CNH— (C,— Cio)-^"'"^ ; 

40 5 — O L-^ 1 

,5 -— CNH— (C2-C,o)-N(C,-C,o)2: — CNH-(C2r-C,o)-N ; 

O O ^ — ' 



50 




-Ct/ V(Ci— Clo) ; and 
H \ / 



55 20. A compound according to any one of claims 14-16 wherein has the structure which results when one of the 
following organic acids Is condensed with an amine group to form T2-C(=0)-N(Ri)-: Formic acid. Acetic acid, 
Piopiolic acid. Propionic acid, Ruoroacetic acid, 2-Butynoic acid, Cyclopropanecartwxylic acid. Butyric acid, Meth- 
oxyacetlc acid, Oifluoroacetic acid, 4-Pentynolc acid, Cyclot>utanecart)oxylic acid. 3,3-Dimethylactylic acid. Valeric 
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acid, N,N-Dimethylglycine, N-Formyl-Gly-OH, Ethoxyacetic acid, (Melhylthio)acetic acid, Pyrrole-2-cart)oxylic ac- 
id, 3-Furoic acid, lsoxazole-5-carboxylic acid. trans-3-Hexenoic acid, Trifluoroacetic acid, Hexanoic acid. Ac- 
Gly-OH, 2-Hydroxy-2-methyIbutyric acid. Benzoic acid. Nicotinic acid. 2-Pyrazinecarboxylic acid, 1 -Methyl-2-pyr- 
rolecarboxylic add, 2-Cyclopentene-1 -acetic acid, Cydopentylacetic acid, (SH-)-2-Pyrrolidone-5-carboxylic acid, 
N-Methyl-L-pro)ine, Heptanoic acid, Ac-b-Ala-OH. 2-Ethyl-2-hydroxybutyric acid, 2-(2-Methoxyethoxy)acetic acid, 
- -PJCpjujc acid, e^Mettiyfni^tjnic^acid, 5-Methyl-2-pyrazinecaftx)xylic add, 2,5-DimelhyIpyrrole-3-cart)Oxylic acid, 
4-Fluorobenzoic acid, 3,5-D(methylisoxazole-4-carboxyfic~acid,~3-C^ 'acid,"Nr 
N-Dimethylsuccinamic acid, Phenylpropiolic acid, Cinnamic acid, 4-Elhylbenzoic acid, p-Anisic acid, 1 ,2,5'Trimeth- 
ylpyn-ole-3-carboxylicacid, 3-Ruoro-4-methylbenzoic add, Ac-DL-Propargylglycine, 3-(Trifluoromethyl)butyric ac- 
id, 1-Piperidinepropionicacid, N-Acetylproline, 3,5-Difluorobenzoic acid, Ac-L-Val-OH, lndole-2-carboxylic acid, 
2-6enzofurancarboxylic add. Benzotriazole-5-carboxylic acid. 4-n-Propylbenzoic acid, 3-Dimethylaminobenzoic 
acid, 4-Ethoxybenzoicadd. 4-(Methylthio)benzoic acid, N-(2-Furoyl)glycine, 2-(Methyithio)nicotinicacid. 3-Fluoro- 
4-methoxybenzoic acid. Tfa-Gly-OH, 2-Napthoic acid, Quinaldic acid, Ac-L-lle-OH, 3-Methyllndene-2-carboxylic 
acid. 2-Quinoxalinecarboxylic acid, 1-Methylindole-2-carboxylic acid, 2,3,6'Trifluorobenzoic acid, N-Formyl- 
L-Met-OH, 2-[2-(2-Melhoxyethoxy)ethoxy]acetic acid. 4-n-Butylbenzoic acid, N-Benzoylglycine, 5-Fluoroindole- 
2-cartx>xylic add. 4-n-Propoxybenzoic acid. 4-Acetyl-3,5-dimethyl-2-py[Tolecarboxyltc add. 3.5-Dimethoxybenzo- 
icacid, 2,6-Dimelhoxynicotinic acid, Cydohexanepentanoic acid, 2-Naphthyiacetlcacid. 4-(1 H-PyrroM-yl)benzoic 
acid, lndole-3-propionic acid, m-Trifluoromethylbenzoic add, 5-Methoxyindole-2-cart)bxylic acid, 4-Pentyibenzoic 
acid. Bz-b-Ala-OH. 4-Diethylaminobenzoic acid, 4-n-Butoxybenzoic acid, 3-Methyl-5-CF3-isoxazole-4-carboxylic 
add. (3,4-Dimethoxyphenyl)acetic acid, 4-Biphenylcarboxylic add, Pivaloyl-Pro-OH, Octanoyl-Gly-OH, (2-Naph- 
thoxy)aceticacid. lndole-3-butyricacid. 4-(Trifluoromethyl)phenylaceticacid. 5-Methoxyindole-3-aceticacid, 4-(Tri- 
fluoronfiethoxy)benzoic acid, Ac-L-Phe-OH. 4-Pentyloxybenzoic add, Z-Gly-OH, 4-Carboxy-N-{fur-2-ylmethyl)pyr- 
rolidin-2-one. 3,4-Diethoxybenzoic acid, 2,4-Dimethyl-5-C02Et-pyrrole-3-cartx)xylic add, N-(2-Fluorophenyl)suc- 
cinamic acid, 3,4,5-Trimethoxybenzoicacid, N-Phenylanthranilic acid, 3-Phenoxybenzoicacid, Nonanoyi-Gly-OH, 

2- Phenoxypyridine-3-carboxylic acid, 2,5-Dimethyi-1-phenylpyn'ole-3K:arboxylic acid, trans-4-(Trifluoromettiyl) 
cinnamic acid, (5-Methyl-2-phenyloxazol-4-yl)acetic acid. 4-(2-Cyclohexenyloxy)benzoic acid. 5-Methoxy-2-meth- 
yl(ndole-3-acetic add, trans-4-Cotininecarboxylic acid, Bz-S-Aminovaleric add, 4-Hexyloxybenzoic acid. 
N-(3-Methoxyphenyl)sucdnamic add, Z-Sar-OH, 4-(3,4-Dimelhoxyphenyl}butyric acid, Ac-o-Fluoro-DL-Phe-OH, 
N-(4-Fluorophenyl)glutaramic acid, 4'-Ethyl-4-bipheny!carboxylic acid, 1,2,3,4-TetrahydroacridinecarboxyIic acid, 

3- Phenoxyphenylacetic acid, N-(2,4-Difluorophenyl)succinamic acid. N-Decanoyl-Gly-OH, (+)-6-Methoxy-a-me- 
thyl-2-naphthaleneacetic add, 3-(Trifluoromethoxy)cinnamicacid, N-Formyl-DL-Trp-OH, (RH+)-a-Methoxy-a-(tri- 
fluoromethyOphenylacetic acid, Bz-DL-Leu-OH, 4-{Trifluoromethoxy)phenoxyacetic add, 4-Heptyloxybenzoic ac- 
id, 2,3,4-Trimethoxycinnamic acid, 2,6-Dimethoxybenzoyl-Gly-OH, 3-(3,4,5-Trimethoxyphenyl)propionic acid, 
2,3,4,5,6-Pentafluorophenoxyacetic acid, N-(2,4-Difluorophenyl)glutarainic acid, N-Undecanoyl-Gly-OH, 

2- (4-Fluorobenzoyl)benzoic acid, 5-Trifluoromethoxyindole-2-carboxy!ic acid, N-(2,4-Difluorophenyl)diglycolamic 
add, Ac-L-Trp-OH. Tfa-L-Phenyiglycine-OH, 3-lodobenzoic acid, 3-(4-n-Pentylbenzoyl)propionic acid, 2-Phenyl- 

4- quinolinecarboxylic add. 4-Octyloxybenzoic acid, Bz-L-Met-OH. 3.4,5-Triethoxybenzoic acid, N-Lauroyl- 
Gly-OH. 3,5-Bis(trifluoromethyl)benzoic acid, Ac-5-Methyl-DL-Trp-OH. 2-lodophenylacetic acid, 3-lodo-4-methyl- 
benzoic add, 3-{4-n-Hexylbenzoyl)propionic acid, N-Hexanoyl-L-Phe-OH , 4-Nonyloxybenzoic acid , 4'-{Trifluorome- 
thyl)-2-biphenylcarboxylic acid, Bz-L-Phe-OH, N-Tridecanoyl-Gly-OH, 3,5-Bis(trifluorDmethyl)phenylacetic acid, 

3- (4-n-Heptylbenzoyl)propionicacid, N-Hepytanoyl-L-Phe-OH. 4-Decyloxybenzoic acid, N-(a,a,a-trifluoro-m-tolyt) 
anthranilic add, Niflumic add. 4-(2-Hydroxyhexafluoroisopropy1)benzoic acid. N-Myristoyl-Gly-OH. 3-(4-n-Octyl- 
benzoyl)proplonic acid, N-Octanoyl-L-Phe-OH, 4-Undecyloxybenzoic acid, 3-(3.4,5-Trimethoxyphenyl)propionyt- 
Gly-OH, 8-lodonaphtholc acid, N-Pentadecanoyl-Gly-OH, 4-Dodecyloxybenzoic acid. N-Palmitoyl-Gly-OH, and 
N-Stearoyl-Gly-OH. 

21. A method for determining the presence of a single nucleotide polymorphism in a nudeic acid target comprising: 

a) amplifying a sequence of a nudeic acid target containing a single nudeotide polymorphism; 

b) generating a single strand fomi of the target; 

c) combining a tagged nucleic acid probe with the amplified target nudeic acid molecules under conditions 
and for a time sufficient to permit hybridization of said tagged nudeic acid probe to complementary amplified 
selected target nudeic acid molecules, wherein said tag is correlative with a particular single nucleotide pol- 
ymorphism and is detectable by spectrometry or potentiometry; 

d) separating unhybridized tagged probe from hybridized tagged probe by a sizing methodology; 

e) cleaving said tag from said probe; and 

0 detecting said tag by spectrometry or potentiometry, and determining the presence of said single nudeotide 
polymorphism, 
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wherein the tagged nucleic acid probe has the structure 



5 

wherein, 

T"^ is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine: 
^0 L Is an organic group which allows a unique T"^-containing moiety to be cleaved from the remainder of the 

compound, wherein the T^-containing moiety, comprises a functional group which supports a single ionized 
charge state when the compound is subjected to mass spectrometry and is tertiary amine, quaternary amine 
or organic acid; and 
X" Is nucleic add. 

15 

22. A method for determining the presence of a single nucleotide polymorphism in a nucleic acid target comprising: 



a) amplifying a sequence of a nucleic acid target containing a single nucleotide polymorphism; 

b) combining a tagged nucleic acid primer with the amplified target nucleic acid molecules under conditions 
20 and for a time sufficient to permit annealing of said tagged nucleic acid primer to complementary amplified 

selected target nucleic acid molecules, wherein the oligonucleotide primer has a 3'-most base complementary 
to the wildtype sequence or the single nucleotide polymorphism, wherein said tag is correlative with a particular 
single nucleotide polymorphism and is detectable by spectrometry or potentiometry; 

c) extending the primer wherein a complementary strand to the target Is synthesized when the 3*-most baseof 
25 the primer Is complementary to the target; 

d) separating unextended tagged primer fi^om extended tagged primer by a sizing methodology: 

e) cleaving said tag from said primers or extended primers; and 

f) detecting said tag by spectrometry or potentiometry, and determining therefrom the presence of said single 
nucleotide polymorphis, 

30 

wherein the tagged nucleic add primer has the structure 



35 



wherein. 



P"^ is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and Iodine; 
40 L Is an organic group which allows a unique T"^-containing moiety to be cleaved from the remainder of the 

compound, wherein the T"^-containing moiety comprises a functional group which supports a single ionized 
charge state when the compound is subjected to mass spectrometry and is tertiary amine, quaternary amine 
or organic acid; and 
X' is nudeic acid. 

45 

23. A method for detemiining the quantity of a specific mRNA molecule in a nudeic acid population comprising: 



a) converting an RNA population Into a cDNA population; 

b) adding a single strand nucleic add (internal standard) containing a plurality of single nucleotide potymor- 
50 phisms, that is otherwise identical to said cONA target; 

c) amplifying a specific sequence of said cDNA target; 

d) coamplifying the internal standard, wherein said internal standard is the same length as the cDNA amplicon; 

e) generating a single strand form of the target; 

f) combining a set of tagged nudeic acid probes with the amplified target cDNA and amplified internal standard 
55 under conditions and for a time sufficient to permit hybridization of said tagged nudeic acid prot)e to comple- 
mentary selected target cDNAand intemal standard sequences, wherein said tag Is correlative with a particular 
cDNA sequence and a second tag is correlative with the intemal standard, and is detedable by spectrometry 
or potentiometry; 



111 



EP 0 990 047 B1 



g) separating unhybridized tagged probe from hybridized tagged probe by a sizing methodology: 

h) cleaving said tag from said probes; 

I) detecting said tags by spectrometry or potentiometry; and 

j) taking the ratio of tag con-elated to cDNA to tag correlated with the internal standard, and determining there- 
from the quantity of said cDNA, thereby determining the quantity of the specific mRNA in a nucleic acid pop- 
ulation, 



wherein the tagged nucleic acid probe has the structure 



wherein, 

T"^^ is an organic group detectable by mass spectrometry, comprising cart>on, at least one of hydrogen and 

fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine; 

L is an organic group which allows a unique T^^s-containing moiety to be cleaved from the remainder of the 

compound, wherein the T^^-containing moiety comprises a functional group which supports a single Ionized 

charge state when the compound is subjected to mass spectrometry and is tertiary amine, quaternary amine 

or organic acid; and 

X* Is nucleic acid. 

24. A method for determining the quantity of a single nucleotide polymorphism in a nucleic acid target comprising: 

a) amplifying a sequence of a nucleic acid target containing a single nucleotide polymorphism; 

b) generating a single strand form of the target; 

c) combining a tagged nucleic acid probe with the amplified target nucleic acid molecules under conditions 
and for a time sufficient to pemnit hybridization of said tagged nucleic acid probe to complementary amplified 
selected target nucleic acid molecules, wherein said tag is correlative with a particular single nucleotide pol- 
ymorphism and is detectable by spectrometry or potentiometry; 

d) separating unhybridized tagged probe from hybridized tagged probe by a sizing methodology; 

e) cleaving said tag from said probes; 

f) detecting said tags by spectrometry or potentiometry; and 

j) taking the ratio of tag con-elated to the wild type polymorphism to the tag correlated with the mutant poly- 
morphism, and determining therefrom the quantity of said polymorphism, 

wherein the tagged nucleic acid has the structure 
wherein, 

T"^ is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 

fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine; 

L is an organic group which allows a unique T"^s-containing moiety to be cleaved from the remainder of the 

compound, wherein the T'^s-containing moiety comprises a functional group which supports a single ionized 

charge state when the compound is subjected to mass spectrometry and is tertiary amine, quaternary amine 

or organic acid; and 

X* is nucleic acid. 



Revendications 

1 . Compost de formule : 



112 



EP 0 990 047 B1 



dans laquelle 

T"^s represente un groupe organique detectable par spectrometrie de masse, comprenant du carbone. au 
moins un des elements consistant en hydrogens et fluorure. des atomes facultatifs choisis entre Toxyg^ne. 
Tazote, le soufre, le phosphore et I'iode ; 

_L represente.un.groupejorgankiuejquLperrne^ contenant T""® unique d'etre cllv6 du reste du 

compose, le groupement contenant T^^ comprenant un groupe fonctionnel qui assur^un etat'de cKarge'id^ 
nisee unique lorsque le compose est soumis a une spectrometrie de masse et qui consiste en une amine 
tertiaire, une amine quatemaire ou un acide organique ; 

X represente un groupe fonctionnel choisi entre un phosphoramidite et H-phosphonate. 

Compose suivant la revendication 1 , dans lequel X represente un groupe phosphoramidite de sorte que P"s,l.x 
a la structure 

dans laquelle R represente un groupe alkyle ou un groupe alkyle substitue comprenant un ou plusieurs 
substituants choisis entre des substituants halogeno et cyano. et les deux groupes R de NR2 peuvent etre lies 
I'un a I'autre pour former un groupe cycloalkyle. 

Compose suivant la revendication 2. dans lequel X represente un groupe phosphoramidite de sorte que P"s.l.x 
a la structure 

TS^CHr-CONH— (CH2)€-Q^^Ol^ 

et OR represente un groupe OCHjCHjCN. tandis que NRg represente un groupe N(isopropyle)2. 

Compose suivant la revendication 1 . dans lequel X possede un groupe H-phosphonate de sorte que 7^-L-X a la 
structure 

O 

k 

dans laquelle R represente un groupe alkyle en C^ a Cq. 

Compose suivant les revendications 1 a 4, dans lequel "r"^ g une masse de 15 ^ 10 000 daltons et la fomiule 
moieculaire C1.500N0-100O0-100S0.10P0-10 Vp's ^^^^ laquelle la somme de a. p et 5 est suffisante pour satisfaire 
les valences par ailleurs non satlsfaites des atomes de C. N et O. 

Compose suivant les revendications 1 a 4, dans lequel T^^ gt L sont lies I'un ^ I'autre par un groupe fonctionnel 
choisi entre des groupes amide, ester, ether, amine, sulture, thioester, disulfiire, thioether, uree, thio-uree. carba- 
mate, thiocarbamate. base de Schiff, base de Schiff reduite, imine, oxime, hydrazone. phosphate, phosphonate, 
phosphoramide, phosphonamide, sulfonate et sulfonamide ou une liaison carbone-carbone. 

Compose suivant les revendications 1 a 4, dans lequel L est choisi entre L^v tadtse^ L^ase^ iio] l(R]. L^nz. L^Jc, 
et L«s, un rayonnement aclinique, un acide, une base, une oxydation, une reduction, une enzyme, une reaction 
eiectrochimique, une reaction thermique et un echange de groupe thiol, respectivement, provoquant le clivage du 
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groupe contenant du reste de la mol6cule. 

8. Compose suivant les revendications 1 a 4, dans lequel L^^ repond a la formule -L^-L^, dans laquelle represente 
un fragment moleculaire qui absorbe le rayonnement actinique pour favoriser le clivage de T"^ de X et et 
5 representent, independamment, une liaison directe ou un groupement organique. U separant de T"^s q{ \J 
se parant de jiij.^ ni ne subissant un clivage de liaison lorsque absorbe le rayonnement actinique. 



9. Compose suivant la revendication 8, dans lequel -L^-L^ repond a la formule : 

10 




20 avec un atome de carbone en position a. b, c, d ou e substitue avec -L^-X et, facultativement, une ou plusieurs 

des positions b. c, d ou e substitutes avec des substituants alkyle. alkoxy, fluorure. chlorure, hydroxyle, carboxylate 
ou amide ; et represente un atome d'hydrogene ou un groupe hydrocarbonyle. 

10. Compost suivant la revendication 8, dans lequel est choisi entre une liaison directe, un groupe hydrocarbylene, 
25 un groupe -0-hydrocarbylene et un groupe hydrocarbylene-(0-hydrocarbyiene)n-H et n represente un nombre 

entierde 1 a 10, 

11. Compost suivant les revendications 1 d 4, dans lequel -L-X rtpond k la formule : 

30 

d 



35 




dans laquelle une ou plusieurs des positions b, c, d ou e sont substitutes avec un atome d'hydrogene. un 
groupe alkyle, alkoxy. fluorure, chlorure, hydroxyle, carboxylate ou amide ; reprtsente un atome d*hydrogtne 
ou un groupe hydrocarbyle, et R^ se termine par un groupe "X". 

45 

12. Compose suivant les revendications 1 a 4. dans lequel rtpond a la fomnule : 

50 

T2 represente un groupement organique forme de cartjone et d'un ou plusieurs des eltments consistant en 
hydrogene, fluorure, iodure. oxygtne, azote, soufre et phosphore, ayant une masse de 1 5 a 500 daltons 

represente un groupement organique formt de carbone et d'un ou plusieurs des titments consistant en 
hydrogene. fluorure, iodure, oxygtne, azote, soufre et phosphore, ayant une masse de 50 ^ 1000 daltons ; 
55 J reprtsente une liaison directe ou un groupe fonctionnet choisi entre des groupes amide, ester, amine, sulfiire. 

ether, thioester, disulfure, thiotther, urte. thio-urte, carbamate, thiocarbamate, base de Schiff, base de Schiff 
rtduite, imine, oxime. hydrazone. phosphate, phosphonate. phosphoramide, phosphonamide. sulfonate, sul- 
fonamide ou une liaison cartone-carbone ; et 
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n repr^sente un nombre entier de 1 d 50 et, lorsque n est superieur d 1 . chacun des groupes et J est choisi. 
ind^pendamnnent. 

13. Compose suivant ia revendlcation 12, dans lequel est choisi entre des groupes hydrocarbyle, hydrocarbyl-0- 
hydrocarbyl^ne, hydrocarbyl-S-hydrocarbylene. hydrocarbyl-NH-hydrocarbylene, hydrocarbyl-amide-hydrocarby- 

lene..Nt(hydroc»rbyl)hydrpj:art)ylene,_N,N^ hete- 

rocyclyihydrocarbyle dans lequel le ou les heteroatomes sont choisis entre des~atorrreFd'bxygen^ 
soufre et de phosphore, heterocydylhydrocarbyie substitue dans lequel le ou les heteroatomes sont choisis entre 
des atomes d'oxygene, d'azote, de soufre et de phosphore et les substituants sont choisis entre des substituants 
hydrocarbyle, hydrocarbyl-O-hydrocarbylene, hydrocarbyt-NH-hydrocarbylene, hydrocarbyl-S-hydrocarbyiene, 
N-(hydrocarbyI)hydrocarbyl6ne, N,N-di(hydrocarbyl)hydrocarbyl§ne et hydrocarbylacyl-hydrocarbylene, ainsi que 
des derives de n'importe lesquels des precedents dans lesquels un ou plusieurs atomes d*hydrog6ne sont rem- 
places par un meme nombre de fluorures, 

14. Compose suivant la revendlcation 1 2, dans lequel repond a la fonnule -G(R2)-. G represente un groupe alkylene 
en Ci a Cq portant un seul substituant R2, et R2 est choisi entre des groupes alkyle, alc^nyle. alcynyle, cydoalkyle. 
cycloalkyle condens6 avec un groupe aryle, cycloalc6nyle, aryle, aralkyle, alcenyle ou alcynyle substitue avec un 
groupe aryle, alkyle substitue avec un groupe cycloalkyle, cycloalkyle substitu6 avec un groupe cycloalcenyle, 
biaryle, alkoxy, alcenoxy, alcynoxy, aralkoxy, alcenoxy ou alcynoxy substitue avec un groupe aryle. alkylamino, 
alcenylamino ou alcynylamino. alkylamino substitue avec un groupe aryle, alcenylamino ou alcynylamino substitu6 
avec un groupe aryle, aryloxy. arylamino, alkyle a substituant N-alkyluree, alkyle a substituant N-arylur6e. alkyle 
a substituant alkylcarbonyiamino, alkyle a substituant aminocarbonyle, heterocyclyle, alkyle ^ substituant hetero- 
cyclyle, amino d substituant heterocyclyle, aralkyle ^ substituant carboxyalkyle, aryle condense avec un groupe 
oxocarbocyclyle et h6t6rocyclylalkyle ; cycloalcenyle, alkyle a substituant aryle et aralkyle, alkyle a substituant 
hydroxy, alkyle d substituant alkoxy, alkyle d substituant aralkoxy, alkyle a substituant alkoxy, alkyle a substituant 
aralkoxy. alkyle d substituant amino, alkyle d substituant (alkyloxycarbonylamino a substituant aryle), alkyle a 
substituant thiol, alkyle a substituant alkylsulfonyle, alkyle d substituant (alkylthio a substituant hydroxy), alkyle a 
substituant thioalkoxy, alkyle d substituant hydrocarbylacylamino, alkyle a substituant h6t6rocyclylacylamino, alk- 
yle a substituant heterocyclylacylamino a substituant hydrocarbyle, alkyle a substituant alkylsulfonylamino, alkyle 
a substituant arylsulfonylamino, morpholino-alkyle, thiomorpholino-alkyle, alkyle a substituant morpholinocarbo- 
nyle. alkyle a substituant thiomorpholinocart)onyle, alkyle a substituant [N-{alkyl, alcenyl ou alcynyl)- ou N,N-[dialk- 
yl, dialc§nyl, dialcynyl ou (alkyi, alcenyl)-amino]carbonyle, heterocyclylaminocartwnyle, alkyle a substituant hete- 
rocyclylaminocarbonyle, alkyle ^ substituant h6terocyclylalkylene-am?nocarbonyle, N,N-[dialyI]alkyl6ne-aminocar- 
bonyle, alkyle a substituant N,N-[diaIkyl]alkylene-aminocarbonyle, heterocyclylcarbonyle a substituant alkyle, h6- 
terocyclylcarbonylalkyle a substituant alkyle, alkyle a substituant cartjoxyle, acylamino-alkyle ^ substituant dialk- 
ylamino et des chaines Iat6rales d'amine-acides choisies entre Targinine, I'asparagine, la glutamine, la S-methyl- 
cyst6ine. la methionine et ses derives correspondants consistant en sulfoxydes et sulfones, la glycine, la leucine, 
risoleucine, rallo-isoleucine. la tertio-leucine. la norleucine, la phenylalanine, la tyrosine, le tryptophane, la proline, 
I'alanine, Tornithine, Thistidine, la glutamine, la valine, la threonine, la serine, I'acide aspartique, la beta-cyano- 
alanine et I'allothreonine ; alcynyle et heterocyclylcarbonyle, aminocarbonyle, amido, mono- ou dialkylaminocar- 
bonyle, mono- ou dlarylaminocart)onyle, alkylarylaminocartK)nyIe, diarylaminocartxDnyle, mono- ou diacylamino- 
carbonyle, acyle aromatique ou' aliphatique, alkyle facultatlvement substitue avec des substituants choisis entre 
des substituants amino, carboxy, hydroxy, mercapto, mono- ou dialkylamino, mono- ou diarylamino. alkylarylamino, 
diarylamino, mono- ou diacylamino, alkoxy, alcenoxy, aryloxy, thioalkoxy, thioalcenoxy, thioalcynoxy, thioaryloxy 
et heterocyclyle, 

15. Compose suivant la revendlcation 12, de formule : 
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10 



15 



I 

Aaaie 

I 

o„j;Wl__ 



dans laquelle 



G represente un groupe (CH2)i^ dans lequei un atome d'hydrogene sur un et un seul des groupes CH2 est 
remplace par un groupe -(CH2)c-amide-T^ ; 

et repr^sentent des groupements organiques de formule C^.25No.90o.9Hq^Fp dans laquelle la somme de 
a et p est suffisante pour satisfaire les valences par ailleurs non satisfaites des atomes de C. N et O ; 
20 I'amide repond a la formule 

O o 

0 n 

25 Amide ^-^N-C—^^ ^C-N-; 



repr6sente un atome d'hydrogene ou un groupe alkyle en a C^o ; 
30 c repr6sente un nombre entier de 0 a 4 ; 

X repond a la definition suivant la revendication 1 ; et 

n represente un nombre entier de 1 a 50 de telle sorte que, lorsque n est superieur a 1 , G, c, I'amide, et 
solQnt choisis ind^pendamment. 

35 16. Compost suivant la revendication 1 5» r^pondant a ia formule 



40 



45 




50 

dans laquelle represente un groupementorganique de formule C^.25No.90o,gHo^Fp dans laquelle la somme 
de a et p est suffisante pour satisfaire les valences par ailleurs non satisfaites des atomes de C, N et O ; et 
comprend une amine tertiaire ou quatemaire ou un acide organique ; et m represente un nombre entier de 0 a 49. 

55 

17. Compost suivant la revendication 15. de formule : 
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dans iaquelie repr6sente un groupement organique de formule Ci.25No.90o.9H„Fp dans laquelle la somme 
de a et p est suffisante pour satisfaire !es valences par ailleurs non satisfaites des atomes de C, N et O ; et 
comprend une. amine tertiaire ou quatemaire ou un acide organique ; el m represente un nombre-entier de 0 a 49. 

1 8. Compose suivant Tune quelconque des revendications 1 6 et 1 7, dans iequel le groupe -amide-T^ est choisi entre : 

Pi — Cio) 

-NHc-cc,-Cjo)-/^ ; ~^M~(^^ • 

o o L I 

19. Compost suivant i*une quelconque des revendications 16 et 17, dans Iequel le groupe -amide-T^ est choisi entre 

— CNH-(C,-Cio>— ( ~\. — cnh~(c,-c,o)-n'^ ^^: . 
o o — / 
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"CNH~(Cj-C,oHN(CrCio)2 ; — 94H--(Cr-C,o)~/~l - 
6 A \— I" 



—a/ \i 





20. Compose suivant Tune quelconque des revendlcations 14 a 16. dans lequel a la structure qui est obtenue 
iorsqu'un des acides organiques suivants est condense avec un groupe amine pour fbmier un groupe T2-C(=0) 
-N(Ri)- : acide formique. acide acetique, acide propioilque, acide propionique, actde fluoracetique, acide 2-buty- 
noique. acide cydopropane-carboxylique, acide butyrique, acide methoxyac6tique. acide difluoracetique, acide 
4-pentynoYque, acide cydobutane-carboxylique, acide 3,3-dim6thylacrylique, acide valerique, N,N-dim6thylglyci- 
ne. acide N-fomriyl-Gly-OH. acide ethoxyacetique, acide (methylthio)acetique, acide pyrrol e-2-carboxylique, acide 
3-furo(que, acide isoxazole-5-carboxy(ique, acide trans*3-hex6noYque, acide trifluoracetique, acide hexanoique, 
Ac-Gly-OH, acide 2-hydroxy-2-m^thylbutyrique, acide benzoTque, acide nicotinique, acide 2-pyrazinecarboxylique, 
acide 1-methyl-2-pyrro!ecarboxy!ique, acide 2-cyclopentene-1 -acetique, acide cydopentylac^tique, acide (SH-)- 

3- pyn^olidone-5-carboxylique, N-methyl-L-proline, acide heptanoique, Ac-b-Ala-OH, acide 2-ethy1-2-hydroxybuty- 
rique, acide 2-(2-methoxy6thoxy)acetique, acide p-toluique, acide 6-methylnicotinique, acide 5-m6thyl-2-pyra2i- 
necart>oxylique, acide 2,5-dimethylpyrrole-3-carboxylique, acide 4-fluorobenzoTque, acide 3,5-dimethylisoxazole- 

4- carboxylique, adde 3-cydopentylpropionique, acide octanoTque, acide N,N-dim6thylsuccinamique, adde ph^- 
nylpropiolique, acide cinnamique. acide 4-6thylbenzoique, acide p-anisique, acide 1.2,5-trlmethylpyrrole-3-car- 
boxylique. acide 3-fluoro-4-m6thylbenzoTque, Ac-DL-proparylglycine, acide 3- (trifluorom6thy1)butyrique, acide 

1- piperidinepropionique, N-acetyl proline, acide 3,5-difluorobenzoYque, Ac-L-Val-OH, acide indole-2-carboxylique, 
acide 2-benzofurannecarboxylique, acide benzotriazole-5-carboxylique, acide 4-n-propylbenzoique, acide 3-di- 
methylaminobenzoTque. acide 4-ethoxybenzoique, acide 4- (methylthio)benzoique, N-(2-furoyl)-glycine. acide 2- 
(m6thylthio)nicotinique, acide 3-fluoro-4-methoxybenzoTque. Tfa-Gly-OH, adde 2*naphtoique, acide quinaldique, 
Ac-L-ile-OH. adde 3-methylindene'2'Carboxylique, adde 2-quinoxalinecart)oxylique, adde 1-m6thylindole-2-car- 
boxylique, acide 2,3,6-trifluorobenzoTque, N-formyl-L-Met-OH, acide 2-[2-(2-m6thoxy6thoxy)-6thoxy]ac6tique, aci- 
de 4-n-butylbenzoique. N-benzoylglycine, acide 5-fluoro-indole-2-carboxylique, acide 4-n-propoxybenzoique, aci- 
de 4-ac6tyl-3.5-dimethyi-2-pyrrolecarboxylique, adde 3,5-dim6thoxybenzoTque, acide 2,6-dim6thoxynicotinique, 
adde cydohexanepentanoTque, adde 2-naphtylacetique. acide 4-(1H-pyrrole-1-yl)b6nzoTque, adde indole-3-pro- 
pionique. acide m-trifluoromethylbenzoTque, acide 5Hiiethoxyindole-2>carboxylique, acide 4-pentytbenzoTque, 6z- 
b-Ala-OH, acide 4-di6thylaminobenzoyque. acide 4-n-butoxybenzoTque, acide 3-methy!-5-CF3-isoxazole-4-car- 
boxylique, acide (3,4-dimethoxyph6nyl)-ac6tique, acide 4-biph6nylcarboxylique, pivaloyl-Pro-OH, octanoyl- 
G!y-OH, acide (2-naphtoxy)acetique, acide indole-3-butyrique, acide 4-(trif)uorom^thyi)ph6nylacetique, acide 

5- m^thoxyindole-3-ac^tique, acide 4-(trifluoromethoxy)4)enzoTque. Ac-L-Phe-OH, acide 4-pentyloxybenzoTque. 
Z-Gly-OH. 4-carlx)xy-N-(fur-2-ylmethyl)pyrrolidlne-2-one, adde 3,4-di6thoxybenzoTque. acide 2,4-dim6thyl- 

5- C02Et-pyrrole-3-carboxylique, acide N-(2-fluorophenyl)-succinamique, acide 3,4,5-trimethoxybenzoique, acide 
N-phenylanthranilique, acide 3-ptienoxybenzoTque, nonanoyl-Gly-OH, acide 2-phenoxypyridine-3-carboxylique, 
acide 2,5-dimethyl-1-phenylpyrro!e-3-carboxylique, acide trans-4-{trifluorom6thyl)cinnamique, acide (5-m6thyl- 

2- phenyloxazole-4-yl)ac6tique, acide 4-(2-cydohexenyloxy)-benzoique, adde 5-methoxy-2-methylindole-3-ac6ti- 
que. acide trans-4-cotininecarboxylique, acide Bz-5-aminoval6rique, adde 4-hexyloxybenzoYque, adde N- (3-m6- 
thoxyphenyO-succinamique, Z-Sar-OH. adde 4-(3,4-dimethoxyph6nyl)-butyrique, Ac-o-f!uoro-DL-Phe-OH, acide 
N-(4-fluoroph6nyl)-glutaramique, acide 4*-ethyl-4-biphenylcart>oxylique, acide 1 .2,3,4-letrahydro-acridinecarboxy- 
lique, acide 3-phenoxyphenylacetique, acide N-(2,4-difluoroph6nyl) succinamique, N-decanoyi-Gly-OH, acide (+)- 

6- m6thoxy-a-mettiyl-2-naphtalene-ac6tique. adde 3-{trifluorom6thoxy)dnnamique, N-formyl-DL-Trp-OH, acide 
(RH+)- a-m6thoxy-a-(trifluorom6thyl)ph6nylac6tique. Bz-DL-Leu-OH, acide 4-(tfifluorom6thoxy)ph6noxyac6ti- 
que, acide 4-hepty1oxybenzoTque, acide 2.3,4-trimethoxycinnamique, 2,6-dimethoxybenzoyl-Gly-OH, acide 

3- {3,4,5-trimethoxyphenyl)propionique, adde 2,3,4,5,6-pentafluorophenoxyac6tique, acide N-(2,4-dl-fluoroph6- 
nyt)glutaramique, N-und^canoyl-Gly-OH. adde 2- (4-fluorobenzoyl)benzo'ique, acide S-trifluorom^thoxyindole- 
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2- carboxylique, acide N-(2.4-difluorophenyl) diglycolamique, Ac-L-Trp-OH. Tfa-L-phenylglycine-OH, acide 3-iodo- 
benzoique, acide 3-(4-n-pentylbenzoyl)propionique, acide 2-phenyl-4-quinoleinecarboxylique, acide 4-octyloxy- 
benzoique. Bz-L-Met-OH, acide 3.4,5-tri^thoxybenzoTque. N-lauroyl-Gly-OH, acide 3,5-bis(trifluorom6thyl)benzoi- 
que, Ac-5-methyl-DL-Trp-OH, acide 2-iodophenylacetique, acide 3-iodo-4-methylbenzoique, acide 3- (4-hexyiben- 

5 zoyl)propionique, N-hexanoyl-L-Phe-OH, adde 4-nonyloxybenzoYque, acide 4'-{trifluoromethyl)-2-biph6ny(car- 

boxylique, Bz-L-Phe-OH, N-tridecanoyl-Gly-OH. acide 3.5-bis(trifiuoromethyl)phenylacetique, acide 3- (4-n-hep- 
^tylbehMylfpTopibmi^ueTN^eptanoyl^Ph 

thrantlique. acide niflumique, acide 4-(24iydroxyhexafluoro-isopropyl)benzotque, N-myristoyl-Gly-OH, acide 

3- (4-n-octylbenzoyl)propionique, N-octanoyl-L-Phe-OH, acide 4-undecyloxybenzoique, 3-(3,4,5-trimethoxyphe- 
10 nyl)propionyl-Gly-OH, acide 8-iodonaphtoique, N-pentadecanoyl-Gly-OH. acide 4-dod6cyloxybenzoYque, N-pal- 

mitoyl-Gly-OH et N-st6aroyI-Gly-OH, 

21. Methode pour determiner la presence d*un polymorphisme de nucleotide unique dans une cible consistant en 
acide nudeique, comprenant les etapes consistant : 

15 

a) a amplifier une sequence d'une able consistant en acide nudeique contenant un polymorphisme de nu- 
deotide unique ; 

b) a engendrer une forme monocat6naire de la cible ; 

c) a associer une sonde d'acide nudeique marquee avec les molecules d'acide nudeique cible amplifie dans 
20 des conditions et pendant un temps suffisants pour permettre I'tiybridation de ladite sonde d'acide nudeique 

marquee aux molecules d'acide nudeique cible choisi amplifie complementaire, ledit marqueur presentant 
une correlation avec un polymorphisme de nucleotide unique particulier et 6tant detectable par spectrometrie 
ou potentiom^trie ; 

d) a s^parer la sonde marquee non hybrid^ de la sonde marquee hybridee par une m^thodologie de 

25 calibrage ; 

e) a diver ledit marqueur de ladite sonde ; et 

f) a detecter ledit marqueur par spectrometrie ou potentiometrie, et a detemiiner la presence dudit polymor- 
phisme de nucleotide unique, 

30 la sonde d'acide nudeique marquee ayant la stnjcture 

35 dans laquelle 

Tf"s represente un groupe organique detectable par spectrometrie de masse, comprenant du carbone, au 
moins un des elements consistant en hydrogdne et fluorure, et des atomes facultatifs choisis entre I'oxygene. 
I'azote, le soufre. le phosphore et Tiode ; 
40 L represente un groupe organique qui permet a un groupement contenant "P^ unique d'etre cliv6 du reste du 

compose, le groupement contenant P"^ comprenant un groupe fonctionnel qui assure un etat de charge io- 
nisee unique lorsque le compose est soumis a une spectrometrie de masse et qui consiste en une amine 
tertiaire, une amine quatemaire ou un adde organique ; et 
X' represente un acide nudeique. 

45 

22. Methode pour determiner la presence d'un polymorphisme de nucleotide unique dans une cible consistant en 
acide nudeique, comprenant les etapes consistant 

a) a amplifier une sequence d'une cible consistant en acide nudeique contenant un polymorphisme de nu- 
so deotide unique ; 

b) a associer une amorce d'acide nudeique marquee avec les molecules d'acide nucieique cible amplifie dans 
des conditions et pendant un temps sufRsants pour permettre I'annelage de ladite amorce d'acide nudeique 
marquee aux molecules de I'acide nudeique cible choisi amplifie complementaire, I'amorce oligonudeotidique 
ayant une base en position 3' extreme complementaire de la sequence de type sauvage ou du polymorphisme 

55 de nudeotide unique, ledit marqueur presentant une correlation avec un polymorphisme de nudeotide unique 

particulier et etant detectable par spectrometrie ou potentiometrie ; 

c) d provoquer I'extensiqn de I'amorce, dans laquelle un brin complementaire de la cible est synthetise lorsque 
la base en position 3' extreme de I'amorce est complementaire de la cible ; 
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d) a separer I'amorce marquee non etendue de Tamorce marquee 6tendue parune methodologiede calibrage ; 

e) a diver ledit marqueur desdites amorces ou amorces 6tendues ; et 

f) d dStecter iedit marqueur par spectrom^trie ou potentiometrie. et d determiner d*aprds cela fa presence 
dudit polymorphisme de nucleotide unique ; 

i'amorce d'acide nucleique marquee ayant la structure 



T'""-L-X' 



10 



dans laquelle 



yms represente un groupe organique detectable par spectrometrie de masse, comprenant du carbone. au 
moins un des elements consistant en hydrog6ne et fluorure et des atomes facultatifs choisis entre I'oxygene, 

15 I'azote, le soufre, le phosphore et I'iode ; 

L represente un groupe organique qui permet a un groupement contenant unique d'etre dive du reste du 
compose, le groupement contenant T"^ comprenant un groupe fonctionnel qui assure un etat de charge io- 
nisee unique lorsque te compose est soumis e une spectrometrie de masse et qui consiste en une amine 
tertiaire, une amine quatemaire ou un adde organique ; et 

20 X' represente un acide nudeique. 

23. Methode pour determiner la quantite d'une molecule d'ARN spedfique dans une population d'acides nudeiques, 
comprenant les etapes consistant : 

25 a) d convertir une population d'ARN en une population d'ADNc ; 

b) a ajouter un acide nudeique monocatenaire (etalon interne) contenant une pluralite de polymorphismes de 
nudeotide unique, qui est par ailleurs identique e ladite cible d'ADNc ; 

c) e amplifier une sequence spedfique de ladite cible d'ADNc ; 

d) a co-amplifier I'etalon interne, retalon interne ayant ainsi la meme longueur que i'amplicon d'ADNc ; 
. 30 e) e engendrer une forme monocatenaire de la cible ; 

f) a associer une serie de sondes d'acide nudeique marquees avec I'ADNc cible amplifie et I'etalon interne 
amplifie dans des conditions et pendant un temps suffisants pour permettre I'hybridation de ladite sonde d'aci- 
de nucleique marquee aux sequences d'ADNc cible choisi complementaire et d'etalon interne, ledit marqueur 
presentant une correlation avec une sequence d'ADNc particuliere et un second marqueur presentant une 

35 correlation avec I'etalon interne, et etant detectable par spectrometrie ou potentiometrie ; 

g) a separer la sonde marquee non hybridee de la sonde marquee hybridee par une methodologie de calibrage 

h) e diver ledit marqueur desdites sondes ; 

i) a detector lesdits marqueurs par spectrometrie ou potentiometrie ; et 

j) d etablir le rapport du marqueur correie ^ I'ADNc au marqueur correte a I'etalon interne, et a determiner 
^0 d'apres ce rapport la quantite dudit ADNc, ce qui peniiet de determiner la quantite de TARN specifique dans 

une population d'acides nudeiques, 

ia sonde d'acide nudeique marquee ayant la structure 

45 



dans laquelle 

T"^ represente un groupe organique detectable par spectrometrie de masse, comprenant du carbone, au 
moins un des elements consistant en hydrogene et fluorure, et des atomes facultatifs choisis entre I'oxygene, 
I'azote, le soufre, le phosphore et I'iode ; 

L represente un groupe organique qui permet ^ un groupement contenant unique d'etre dive du reste du 
compose, le groupement contenant comprenant un groupe fonctionnel qui assure un etat de charge io- 
nisee unique lorsque le compose est soumis a une spectrometrie de masse et qui consiste en une amine 
tertiaire, une amine quatemaire ou un adde organique ; et 
X' represente un adde nudeique. 
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24. Methode pour determiner la quantite d'un polymorphisme de nucleotide unique dans une cible d*acide nudeique. 
comprenant les stapes consistant : 

a) a amplifier une sequence d'une cible acide nudeique contenant un polymorphisme de nucleotide unique ; 

b) a engendrer une forme mpnocatenaire de la cible ; 

c^a assoder une sonde d'acide nudeique marquee avec les molecules d*aclde nudeique cibie ampltfie dans 

des conditions^ pendanfuFtemps 

marquee aux molecules d'acide nudeique dble choisi amplifie complementaire» ledit marqueur presentant 
une correlation avec un polymorphisme de nucleide unique particulier et etant detectable par spectrometrie 
ou potentiometrie ; 

d) a separer la sonde marquee non hybrtdee de la sonde marquee hybridee par une methodologie de 
calibrage ; 

e) a diver ledit marqueur desdttes sondes ; 

f) a detecter ledit marqueur par spectrometrie ou potentiometrie ; et 

j) a etablir le rapport du marqueur correle au polymorphisme de type sauvage au marqueur correle au poly- 
morphisme mutant et d determiner d*apres ce rapport la quantity dudit polymorphisme, 

I'adde nudeique marque ayant la structure : 

T"^-L-X' 

dans laquelle 

jms repr^sente un groupe organique detectable par spectrometrie de masse, comprenant du carbone, au 
moins un des elements consistant en hydrogene et fluorure, et des atomes facultatifs choisis entre I'oxygene, 
I'azote, le soufre, le phosphore et Tiode ; 

L repr^sente un groupe organique qui penmet a un groupement contenant unique d'etre dive du reste du 
compose, le groupement contenant T"^ comprenant un groupe fonctionnel qui assure un etat de charge io- 
nisee unique lorsque le compose est soumis d une spectrometrie de masse et qui consiste en une amine 
tertiaire, une amine quatemaire ou un acide organique ; et 
X' represente un acide nudeique. 



PatentansprCiche 

1. Verbindung der Formel: 

worin eine organische Gruppe ist, die durch Massenspektrometrie nachweisbar ist, umfassend Kohlenstoff, 
mindestens eines von WasserstofT und Fluor, und gegebenenfalls Atome, ausgewShIt aus Sauerstoff, Stickstoff, 
Schwefel, Phosphor und Jod; 

L ist eine organische Gruppe, die es einer einmaligen T^-enthaltenden Gruppe ermdglicht, von dem Rest der 
Verbindung abgespalten zu werden, wobei die T^enthaltende Gmppe eine funktionelle Gruppe umfafit, die einen 
einzelnen ionisierten Ladungszustand unterstutzt, wenn die Verbindung Massenspektrometrie unterzogen wird, 
und die tertiares Amin, quatemares Amin Oder organische Saure ist; 
X ist eine funktionelle Gruppe. ausgewahit aus Phosphoramidit und H-Phosphonat.. 

2. Verbindung nach Anspruch 1, worin X eine Phosphoramiditgruppe ist, so dad "P^-L-X die Stmktur 
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NR2 



aufweist, worin R eine Alkylgruppe Oder eine subslituierte Alkylgruppe ist, die eine oder mehrere Substituenten 
ausgewahit aus Halogen und Cyano aufweist. und die zwei R-Qruppen von NR2 kdnnen miteinander verbunden 
sein. urn eine Cydoalkylgruppe zu bilden. 

Verbindung nach Anspruch 2, worin X eine Phosphoramiditgmppe ist, so dad 7^*L-X die Stmktur 



aufweist und OR ist OCR2CH2CN, wShrend NRj N(iso-propyl)2 ist. 

Verbindung nach Anspruch 1 , worin X eine H-Phosphonatgruppe aufweist, so daft T^^-L-X die Struktur 



aufweist, worin R eine C^^-Alkylgmppe ist. 

Verbindung nacti einem der Anspruche 1 bis 4, worin eine l\1asse von 1 5 bis 1 0.000 Dalton und eine molekulare 
Formel von C^.5ooNo.iooOo.i oo^o-i 0^0-1 o^^a^p's weist, worin die Summe von a, p und 6 ausreichend ist. die 
sonst nicht gesattigten Vaienzen der N- und O-Atome zu sattigen. 

Verbindung nach den Anspruchen 1 bis 4, worin T"^ und L miteinander uber eine funktionelle Gruppe ausgewahit 
aus Amid, Ester, Ether. Amin, Sulfid, Thioester, Disulfid, Thioether. Hamstoff, Thiohamstoff, Carbamat, Thiocart)- 
amat, Schiffsche-Base, reduzierte Schiffsche-Base, Imin, Oxim, Hydrazon, Phosphat, Phosphonat. Phosphora- 
mki. Phosphonamid, Sulfonat, Sulfonamid oder Kohlenstoff-Kohlenstoff-Bindung verbunden sind. 

Verbindung nach den Anspruchen 1 bis 4, worin L ausgewahit ist aus L^^, Insure L^ase l{0] |_ir)^ Lenz Leic^ la 
und L^s, wobei jeweils aktinische Strahlung, Saure, Base, Oxidation, Reduktion, Enzym, elektrochemische, ther- 
male und Thiol-Austausch dazu fOhren. da(l die P"^-«nthaltende Gmppe von dem Rest des MolekOls abgespalten 
wird. 

Verbindung nach den Anspruchen 1 bis 4, worin L^^ die Forme! L^-L^-L^ aufweist. 

worin L2 ein molekulares Fragment ist, daft aktinische Strahlung absorbiert. um die Abspaltung von V^^ von X zu 
unterstutzen, und U und unabhangig voneinander eine direkte Bindung oder eine organische Gruppe sind, 
wobei von abtrennt und von X abtrennt und weder noch einer Bindungsspaltung unterzogen 
werden, wenn die actlnische Strahlung absorbiert. 

Verbindung nach Anspruch 8. worin -L^-L^ die Formel: 



TSSCHz-CONH— (CH2)6-0^ .O-R 



. NR2 



O 

TiE!L-0-}-0" RaNH"*" 
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15 



20 



25 



30 



aufweist, wobei ein Kohlenstoffatom an Positionen a, b, c, d oder e mit -L^-X substituiert ist und gegebenenfalls 
eine Oder mehrere der Positionen b, c, d oder e mit AlkyI, Alkoxy, Fluor, Chlor, Hydroxyl, Carboxylat Oder Amid 
substituiert ist; und Wasserstoff oder HydrocarfoyI ist. 

10. Verbindung nach Anspruch 8, worin ausgewShIt istaus einerdirekten Bindung, einem Hydrocarbylen, -OHydro*- 
carbylen und Hydrocart)ylen-(-0-hydrocarbylen)^-H und n eine ganze Zahl von .1 bis 10 isL 

11. Verbindung nach den Anspruchen 1 bis 4, worin -L-X die Formel: 




auiweist, worin eine oder mehrere der Positionen b. c, d oder e durch Wasserstoff. AlkyI, Alkoxy, Fluor, Chlor, 
Hydroxyl, Cart>oxylat oder Amid substituiert ist; ist Wasserstoff oder Hydrocarbyl, und R2 endet in einer "X"- 
35 Gruppe. 

12. Verbindung nach den Anspruchen 1 bis 4, worin V^^ die Formel: 

^ T^KJ-T^n- 

aufweist. eine organische Gruppe ist, die aus Kohienstoff und einem oder mehreren von Wasserstoff, Fluor, 
Jod. Sauerstoff, Stickstoff, Schwefel und Phosphor gebildet ist. die eine Masse von 15 bis 500 Dalton aufweist; 
T3 eine organische Gruppe ist, die aus Kohienstoff und einem oder mehreren von Wasserstoff, Fluor. Jod. Sau- 

45 erstoff. Stickstoff. Schwefel und Phosphor gebildet ist. und die eine Masse von 50 bis 1.000 Dalton aufweist; 

J ist eine direkte Verbindung oder eine funkticnelle Gmppe, ausgewahit aus Amid, Ester, Amin, Sulfid, Ether, 
Thioester, Disulfid, Thioether, Hamstoff, Thiohamstoff. Carbamat. Thiocarbamat, Schiffsche-Base, reduzierte 
Schiffsche-Base, Imin, Oxim, Hydrazon, Phosphat, Phosphonat. Phosphoramid, Phosphonamid, Sulfonat. Sulfon- 
amid oder Kohlenstoff-Kohlenstoff-Bindung; und 

50 n ist eine ganze Zahl von 1 bis 50. und wenn n grd&er als 1 ist, ist jedes von und J unabhSngig voneinander 

ausgewahit. 

13. Verbindung nach Anspruch 12. worin ausgewahit ist aus Hydrocarbyl, Hydrocarbyl-0-hydrocart)ylen, Hydro- 
carisyl-S-hydrocarbylen. Hydrocarbyl-NH-hydrocarbylen, Hydrocarbyl-arnid-hydrocarbylen, N-(Hydrocarbyl)hy- 

55 drocarbylen, N.N<Di(hydrocarby1)hydrocarbyten. Hydrocarbylacyl-hydrocarbylen. Heterocydylhydrocarbyl, worin 

das/die Heteroatom{e) ausgewahit ist/sind aus Sauerstoff, Stickstoff, Schwefel und Phosphor, substitutiertes He- 
terocydylhydrocarbyl. worin das/die Heteroatom(e) ausgewahit ist/sind aus Sauerstoff. Stickstoff. Schwefel und 
Phosphor und die Substituenten ausgewahit sind aus Hydrocarbyl, Hydrocarisyl-O-hydrocarbylen, Hydrocar- 
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byl-NH-hydrocarbylen, Hydrocarbyl-S-hydrocarbylen, N-(Hydrocarbyl)hydrocarbylen. N,N-Di(hydrocarbyl)hydro- 
carbylen und Hydrocarbylacyl-hydrocarbylen. sowie Derivative von jedem der Voranstehenden. wobei einer oder 
mehrere Waerstoffe durch eine gleiche Zahl von Fluor ersetzt ist. 

14. Verbindung nach Anspruch 12, worin P die Formal -G{R2)- aufweist. G ist C^^ Alkylen mit einem einzelnen R2 
Substiluenten, und ist ausgewahlt aus Alkyl, Alkenyl, Alkinyl. Cycloalkyl, Aryl-fusioniertes-Cycloalkyl, Cycloal- 
kehyirAryirAlBlkyirAryisubstitutiertesAikenytod 

iertes Cycloalkyl. Biaryl, Alkoxy. Aikenoxy, Alkinoxy, Aralkoxy, Aryl-substituiertes Alkenoxy oder Alkinoxy, Alky- 
lamino, Alkenylamino oder Alkinylamino. Aryl-substituiertes Alkylamino, Aryl-substituiertes Alkenylamino oder Al- 
kinylamino. Aryloxy, Arylamino, N-Alkylharnstoff-substituiertes: Alkyl, N-arylhamstoff-substituiertes Alkyl. Alkylcar- 
bonylamino-substituiertes Alkyl, Aminocart>onyl-substituiertes Alkyl, Heterocydyl, Heterocyclyl-substituiertes Al- 
kyl. Heterocyclyl-substituiertes Amino, CarboxyalkyI substituiertes Aralkyl, Oxocarbocydylfusioniertes Aryl und 
Heterocydylalkyl; Cydoalkenyl. Aryl-substituiertes Alkyl und Aralkyl. Hydroxy-substituiertes Alkyl, Alkoxy-substi- 
tuiertes Alkyl, Aralkoxy-substituiertes Alkyl, Alkoxy-substitu iertes Alkyl, Aralkoxy-substituiertes Alkyl, Amino-sub- 
stitu iertes Alkyl, (Aryl-substituiertes Alkyloxycart3onylamino)-substituiertes Alkyl, Thiol-substituiertes Alkyl, Al- 
kylsulfonyl-substituiertes Alkyl, (Hydroxy-substituiertes Alkylthio)-substituiertes Alkyl, Thioalkoxy-substituiertes Al- 
kyl. Hydrocarbylacylamino-substituiertes Alkyl, Heterocydylacylamino-substituiertes Alkyl, Hydrocariiyl-substitu- 
lertes-Heterocydylacylamino-substituiertes Alkyl, Alkylsulfonylamino-substituiertes Alkyl, Arylsulfonylamino-sub- 
stituiertes Alkyl, Morpholino-alkyl, Thiomorpholino-alkyl, Morpholinocarbonyl-substituiertes Alkyl, Thiomorpholino- 
carbonyl-substituiertes Alkyl, [N-(Alkyl, Alkenyl or Alkinyl)- oder N,N-[Dialky!, Dialkenyi. Dialklnyl oder (Alkyl, Al- 
kenyl)-amino]carbonyl-substitulertes Alkyl, Heterocydylamlnocarbonyl, Heterocylylalkylenaminocarbonyl, Hetero- 
cydylaminocarbonyl-substituiertes Alkyl, Heterocylylalkylenaminocarbonylsubstituiertes Alkyl, N,N-[Diafkyl] 
alkyienaminocarbonyl. N.N-[Dialkyl]alkylenaminocartx)nyl-substituiertes Alkyl, Alkyl-substituiertes Heterocydyl- 
carbonyl, Alkyl-substituiertes Heterocydylcarbonyl-alkyl. Carboxyl-substituiertes Alkyl, Dialkylamino-substituier- 
tes Acylaminoalkyl und AminosSure-Seitenketten, ausgewahlt aus Arginin, Asparagin, Glutamin, S-Methylcystein, 
Methionin und entsprechenden Sulfoxid- und Sulfonderivaten davon, Glydn, Leucin. Isoleucin, Allo-lsoleudn, Tert- 
Leudn, Norteucin, Phenylalanin, Tyrosin, Tryptophan, Prolin, Alanin. Omithin, Histidin. Glutamin, Valin, Threonin, 
Serin, Asparaginsaure, beta-Cyanoalanin, und Allo-Threonin; Aiinyl- und Heterocydylcarbonyl, Aminocarbonyl, 
Amido. Mono- oder DIalkylaminocarbonyl, Mono- oder DiarylaminocartK>nyl, Alkylarylaminocarbonyf, Diarylami- 
nocart>onyl. Mono- oder Diacylaminocart)onyl, aromatischem oder aliphatischem Acyl, Alkyl, gegebenenfalls sub- 
stituiert durch Substituenten, ausgewahlt aus Amino, Carboxy, Hydroxy, Mercapto, Mono- oder Dialkylamino, Mo- 
no- Oder Diarylamino, Alkylarylamino, Diarylamino, Mono- oder Diacylamino, Alkoxy, Alkenoxy, Aryloxy, Thioalk- 
oxy, Thioalkenoxy, Thioalkinoxy, Thioaryloxy und Heterocydyl. 

15. Verbindung nach Anspruch 12, die die folgende Fonnet: 




Amid 

I 

O .(CH2)c . 



aufweist, worin G (CH2)i ^ ist. wonn ein Wasserstoff an einer und nur einer der CH2-Gruppen durch -(CH2)c-Amid- 
T"^ ersetzt ist; 

■p und organische Gruppen der Formel Ci.25No.90o.9H„Fp sind. worin die Summe von a und p ausreichend 
ist, die sonst ungesdttigten Valenzen von C-, N- und O-Atomen zu sattigen; 
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O O 
, . H II 

Amidist N-C — or — C-N— ; 




ist Wasserstoff oder C^.^q AlkyI; 
10 c ist eine ganze Zahl im Bereich von 0-4; 

X ist gemad Anspruch 1 definiert; und 

n ist eine ganze Zahl im Bereich von 1 bis 50. so dali, wenn n grader als 1 ist, G, c. Amid, und unabhSngig 

voneinander ausgewahit sind. 

15 16, Verbindung nach Anspruch 15, die die Fornriel 



20 



25 



30 



35 



40 



45 



50 

aufweist, worin eine organische Gruppe der Forme! Ci.25No.90o.9H^jFp ist, worin die Summe von a und p aus- 
reichend ist, um die sonst ungesattigten Valenzen der N-und O-Atome zu sSttigen; und ein tertiares oder 
quatemares Amin oder eine organische Sdure einschllef&t; und m eine ganze Zahl im Bereich von 0<49 ist. 

55 18. Verbindung nach einem der Anspruche 16 und 17, worin -Amid-7^ ausgewahit ist aus: 



aufweist, worin eine organische Gnjppe der Formel C^.25No.90o.9H„Fp ist, worin die Summe von a und p aus- 
reichend ist, die sonst ungesattigten Valenzen der N- und O-Atome zu sSttigen und ein tertiHres oder qua- 
temares Amin oder eine organische Saure einschlielit; und m eine ganze Zahl im Bereich von 0-49 ist. 

17. Verbindung nach Anspruch 15, die die Formel 
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o , . o ^ 

(Pi-C,q) 



-NHC-(C,-C,o)-N^~^ ; -NHC-H^N ; ■ 




~^U \ 7'-(C,— Co); und — NHC-(C,-C,o)-N' 

O 



19. Verbindung nach einemderAnsprOche 16 und 17, worin -Amid-T^ ausgewShlt ist aus: 




(?,— C,o) ' (Ci-C,o) 



— CNH-(C^-C,o)-N^ y ; — CNH— (C,-Cjo>- Y"^^ 



6 o 



-^CNH-{C2-C,o)-N(C,-C,o)2: — CNH-(Cr-Cio)-N^ ^ 




— CN N(C,— C,o) ; und 



20. Verbindung nach einem der Anspruche 14-16, worin die Struktur aufweist, die sich ergibt, wenn eine der fol- 
genden organischen Sauren mit einer Amin-Gruppe kondensiert wird, urn T2-C{=0)-NCRi)- zu btlden: Ameisen- 
sdure, Esslgsdure, Proplolinsaure, Proptonsaure, RuoressigsSure, 2-Butinonsdure. Cydopropancarboxylsdure. 
Buttersaure, Methoxyesslgsaure, DifluoressigsSure, 4-Pentinonsaure, Cyclobutancarboxylsaure, 3,3-Dime- 
thylacrylsaure, Valerinsaure, N,N-Dimethylglycin, N-Formyl-Gly-OH, Ethoxyessigsaure, (Methyithio)essigsaure, 
Pyrrol-2-carboxylsaure, 3-Furoesaure, lsoxazol-5-carboxylsaure, trans-3-Hexenonsaure, Trifluoressigsaure, He- 
xanonsaure, Ac-Gly-OH, 2-Hydroxy-2-methylbuttersaure. Benzoesdure, Nicotinsaure, 2-Pyrazincarboxylsaure, 
1-Methyl-2-pyrrolcartoxy1sSure, 2-Cyclopenten-1-essigsdure, CydopentylessigsSure* (SH'V^-Pyrrolidon-S-car- 
boxylsaure, N-Methyl-L-prolin, Heptanonsaure, Ac-b-Ala-OH, 2-Ethyl-2-hydroxybuttersaure. 2-(2-Methoxyethoxy) 
essigsaure, p-ToluonsSure, 6-Methylnicotinsaure, 5-Methyl-2-pyrazincarboxylsaure, 2,5-Dimethylpyrrol-3-car- 
boxylsSure. 4-Fluorbenzoesaure. 3,5-Dimethylisoxazot-4-carboxylsaure, 3-Cyclopentylpropionsaure, Octanon- 
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saure, N,N-Dimethylbemsteinsaure, Phenylproplolinsaure, Zimtsaure, 4-Ethylbenzoesaure, p-Anlsinsaure. 
1 ,2,5-Trimethylpyrrol-3-carboxylsaure. 3-FIuor-4-methylbenzoesaure, Ac-DL-Propargylg!ycin, 3-(Trifluormethyl) 
buttersaure. 1-Piperidinpropionsaure. N-Acelylprolin, 3,5-Difluorbenzoesaure, Ac-L-Val-OH. lndol-2-carboxylsau- 
re. 2-Benzofurancarboxylsaure, Benzotriazol-5-carboxyisaure. 4-n-Propylbenzoesaure, 3-Dimethylaminobenzoe- 
s^ure, 4-Ethoxybenzoesaure. 4-{Methylthio)benzoesaure, N-(2-Furoyl)glycin, 2-{Methylthlo)nicotinsaure, 3-Fluor- 
j^rmethoxyben^zoesgure.^ 2-N apthoinsaure. ChinaldinsSure, Ac-L-lle-OH, 3-Methylinden-2-carboxyl- 

saure, 2-Chinoxalincarboxylsaure, 1 -Methylindol-2^carboxyls^ure, 2;3;6-TnnuoH5enzoesaure^ — N^Formyl- 
L-Met-OH, 2-[2-(2-Methoxyethoxy)ethoxy]essigsaure, 4-n-Butylbenzoesaure, N-Benzoylglycin, 5-FIuorindol- 
2-carboxylsaure, 4-n-Propoxybenzoesaure, 4-AcetyI-3,5-dimethyl-2-pyrTolcarboxylsaure, 3,5-Dlmethoxybenzoe- 
saure, 2.6-Dimethoxynicotinsaure. Cyclohexanpentanonsaure, 2-Naphthylesslgsaure, 4-(1H-Pyrrol-l-yi)benzoe- 
saure, Indol-S-propions^ure. m-Trifluormethylbenzoesaure, 5-Methoxyindol-2-carboxyisaure, 4-Pentylbenzoe- 
saure, Bz-b-Ala-OH, 4-Diethylaminobenzoesaure, 4-n-Butoxybenzoesaure, 3-Methyl-5-CF3-isoxazol-4-carboxyl- 
saure, (3,4-Dimethoxyphenyl)essigsaure, 4-Biphenylcarboxylsaure, Pivaloyl-Pro-OH. Octanoyl-Gly-OH, (2-Naph- 
thoxy)essigsaure, lndol-3-buttersaure, 4-(Trifluormethyl)phenylessigsaure, 5-Methoxyindol-3-e$sigsaure, 
4-Trifluormethoxy)benzoesaure, Ac-L-Phe-OH, 4-Penty!oxybenzoesaure. 2-Gly-OH, 4-Carboxy-N-(fur-2-ylme- 
thyl)pyrrolidin-2-on, 3.4-Oiethoxybenzoesdure, 2,4-Diniethyl-5-C02Et-pyrrol-3-carboxylsaure, N-(2-Fluorphenyl) 
bemsteinsaure, 3,4,5-Trimethoxybenzoesaure, N-Phenylanthranilinsaure, 3-Phenoxybenzoesaure, Nonanoyl- 
Gly-OH, 2-Phenoxypyridin-3-carboxylsaure, 2, 5-Dimethyl-1 -phenyl pyrrol-3-carboxylsaure. trans-4-(Trifluorme- 
thyl)zinnsaure, (5-Methyl-2-phenyloxazol-4-yl)essigsaure, 4-(2-CyclohexenyIoxy)benzoesaure, 5-Methoxy-2-me- 
thylindol-3-essigsaure, trans-4-Cotinincarboxylsaure, Bz-5-Aminovalerinseiure, 4-Hexyloxybenzoesaure, N-(3-Me- 
thoxyphenyl)bemsteinsaure. Z-Sar-OH, 4-(3.4-Oimethoxyphenyl)buttersSure, Ac-o-Fluor-DL-Phe-OH, N-(4-FIuor- 
phenyl)glutarminsaure, 4'-Ethyl-4-biphenylcarboxylsaure, 1 ,2.3,4-Tetrahydroacridincarboxyisaure, 3-Phenoxy- 
phenylessigsaure, N-(2,4-Difluorphenyl)bemsteinsaure, N-Decanoyl-Gly-OH, {+>-6-Methoxy-a-methyi-2-naph- 
thalinessigsaure, 3-(Trifluormethoxy)zinnaminsaure, N-Formyl-DL-Trp-OH, (RH+)-a-Methoxy-a-(trifluormethyl) 
phenyiessigsaure, Bz-DL-Leu-OH, 4-{Trifluormethoxy)phenoxyessigsaure, 4-Heptyloxybenzoesaure, 2,3,4-Tri- 
methoxyzinnaminsaure, 2,6-Dlmethoxybenzoyi-Gly-OH, 3-(3,4,5-Trimethoxyphenyl)propionsaure, 2,3,4.5,6-Pen- 
tafluorphenoxyessigsaure. N-(2.4-Difluorphenyl)glutafminsaure, N-Undecanoyl-Gly-OH. 2-(4-Fluorbenzoyl)ben- 
zoesSure, 5-Trifluormethoxyindol-2-carboxylsaure, N-(2,4-Difluorphenyl)diglycolaminsaure, Ac-L-Trp-OH, Tfa- 
L-Phenylglycin-OH, 3-Jodbenzoesaure, 3-(4-n-Pentylbenzoyl)propionsaure, 2-Phenyl-4-chinolincarboxylsaurei 
4-Octyloxybenzoesaure, Bz-L-Met-OH, 3,4,5-Triethoxybenzoesaure, N-Lauroyl-Gly-OH, 3.5-Bis(trifluormethyi) 
benzoesaure, Ac-5-Methyl-DL-Tip-OH, 2-JodphenylessigsSiure, 3-Jod-4-methylbenzoesaure, 3-(4-n-Hexylbeh- 
zoyl)propionsaure. N-Hexanoyl-L-Phe-OH, 4-Nonyloxybenzoesaure, 4 -(Trifluorfnethyl)-2-biphenylcarboxylsaure, 
Bz-L-Phe-OH, N-Tridecanoyl-Gly-OH. 3.5-Bis{trifluormethyl)phenylessigsaure, 3-(4-n-Heptylbenzoyl)propionsau- 
re, N-Hepytanoyl-L-Phe-OH, 4-DecyIoxybenzoes§ure, N-(a,a,a-Trlfluor-mtolyl)anthranillnsaure, Nifluminsaure, 
4-(2-Hydroxyhexafluorisopropyl)benzoesaure. N-Myrlstoyl-Gly-OH, 3-(4-n- Octylbenzoyl)propionsaure, N-Octa- 
noyl-L-Phe-ON, 4-Undecyloxybenzoesaure, 3-(3,4,5-Trimethoxyphenyl)propionyl-Gly-OH. 8-Jodnaphthoinsaure, 
N-Pentadecanoyl-Gly-OH, 4-Dodecyloxybenzoesaure, N-Palmitoyl-Gly-OH und N-Stearoyl-Gly-OH. 

21. Verfahren zur Bestimmung der Anwesenheit eines Single Nukleotide Polymorphismus in einem Nukleinsaureziel, 
umfassend: 

a) Amplifizieren einerSequenz eines Nukleinsaureziels, das einen Single Nukleotide Polymorphismus enthalt; 

b) Erzeugen einer EInzelstrangform des Ziels; 

c) Kombinieren einer markierten Nukleinsauresonde mit den amplifizierten Zielnukleinsauremolekulen unter 
Bedingungen und fur eine Zeitdauer, die ausreichend ist, urn eine Hybridisierung der markierten Nukleinsau- 
resonde an komplementdre amplifizierte ausgewdhlte Zielnukleinsauremolekule zu ermoglichen. wobei die 
Markienjng mit einem bestimmten Single Nukleotide Polymorphismus korrelierbar ist und durch Spektrometrie 
Oder Potentiometrie nachweisbar ist; 

d) Abtrennen von nicht hybridisierter markierter Probe von hybridisierter markierter Probe durch eine Grofien- 
bestimmungs-Methodologie; 

e) Abspaiten des Markers von der Sonde; und 

0 Nachweisen des Marlcers durch Spektrometrie Oder Potentiometrie und Bestimmen der Anwesenheit des 
Single Nukleotide Polymorphismus, wobei die markierte Nukleinsduresonde die Struktur 
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aulweist, worin T^^ eine organische Gaippe ist, die durch Massenspektrometrie nachweisbar ist, umfassend Koh- 
5 lenstoff, mindestens eines von Wasserstoff und Fluor, und gegebenenfalls Atome ausgewahit bus Sauerstoff. Stick- 
sto ff, Schwefel. Phosphor und Jod; 

L ist eine organische GruppeTdie es^eineTeinmaligeli^^^ 

Verbindung abgespalten zu warden, wobei die T^^nthaltende Gruppe eine funktlonelle Gruppe umfal^t. die einen 
einzelnen ionisierten Ladungszustand unterstutzt. wenn die Verbindung Massenspektrometrie unterzogen wird. 
10 und die tertiares Amin. quaternares Amin oder organische Saure ist; und 

X* ist Nukleinsaure. 

22. Verfahren zur Bestimmung der Anwesenheit eines Single Nukleotide Polymorphismus in einem Nukleinsaurezlel, 
umfassend: 

f5 

- a) Amplifizieren einer Sequenz eines Nukleinsaureziels« das einen. Single Nukleotide Polymorphismus enthSIt; 

b) Kombinieren eines markierten Nukleinsdureprimers mit den amplifizlerten Zielnukleinsduremolekulen unter 
Bedingungen und fur eine Zeitdauer, die ausreichend ist, urn ein Annealing des markierten Nukleinsaurepri- 

20 mers an kompiementdre amplifizierte ausgewahlte Zielnukleinsauremolekule zu ermoglichen, wobei der Oli- 

gonukleotidprimer eine zu der Wildtyp-Sequenz oder dem Single Nukleotide Polymorphismus komplementare 
auf^erst 3'-liegende Base aufweist, wobei der Marker mit einem bestimmten Single Nukleotide Polymorphis- 
mus korrelierbar ist und durch Spektrometrie oder Potentiometrie nachweisbar ist; 

25 c) Veriangem des Primers, wobei ein zu dem Ziei komplementarer Strang synthetisiert wird, wenn die am 

weitesten 3'-liegende Base des Primers mit dem Ziel komplementdr ist; 

d) Abtrennen von nicht veriangertem markierten Primer von veriangertem markierten Primer durch eine Gro- 
denbestimmungs-MethodoIogie; 

30 

e) Abspalten des Maricers von den Primem oder veridngerten Primem; und 

f) Nachweisen des Maricers durch Spektrometrie oder Potentiometrie, und Bestimmen daraus der Anwesenheit 
des Single Nukleotide Polymorphismus, 

35 

wobei der markierte NukieinsSureprimer die Struktur 

40 

aufweist, worin V^^ eine organische Gruppe ist, die durch Massenspektrometrie nachweisbar ist. umfassend Koh- 
lenstoff, mindestens eines von Wasserstoff und Fluorid. und gegebenenfalls Atome ausgewahit aus Sauerstoff, 
Stickstoff, Schwefel, Phosphor und Jod; 

L ist eine organische Gruppe, die es einer einmaligen P^-enthaltenden Gruppe ermoglicht, von dem Rest der 
45 Verbindung abgespalten zu werden, wobei die T^^nthaltende Gruppe eine funktlonelle Gruppe umfallt, die einen 
einzelnen ionisierten Ladungszustand unterstutzt, wenn die Verbindung Massenspektrometrie unterzogen wird, 
und die teriiares Amin, quaternares Amin oder organische Saure ist; und 
X' Ist Nukleinsdure. 

50 23. Verfahren zur Bestimmung der Menge eines speziflschen mRNA-MoIekuls in einer Nukleinsaurepopulation, um- 
fassend: 

a) Oberfuhren einer RNA-Population in eine cDNA-Population; 

55 b) Hinzufugen einer einzelstrSngigen Nukieinsdure (intemer Standard), die eine Vielzahl von Single Nukleotide 

Polymorphismen enthalt. die ansonsten identisch zu dem cDNA-Ziel ist; 

c) Amplifizieren einer speziflschen Sequenz des cDNA-Ziels; 
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d) Coampliftzieren des intemen Standards, wobei der interne Standard dieselt)e Lange aufweist. wie das cD- 
NA-Amplicon; 

e) Erzeugen einer einzelstrangigen Form des Ziels; 

5 

f) Kombinieren eines Sets von markierten Nukleinsauresonden mit der amplifizierten Ziel-cDNA und dem am- 

ptifizierten intemen Slandarduntef Bedingm 

mng der markierten Nukteinsauresonde an komplementdre ausgewahlte Ziet-cDNA und interne Standardse- 
quenzen zu eriauben, wobei der Marker mit einer bestimmten cDNA-Sequenz korrelierbar ist und ein zweiter 
10 Marker ist mit dem internen Standard korrelierbar. und ist durch Spektrometrie oder Potentiometrie nachweis- 

bar; 

g) Abtrennen von nicht hybridisierter marklerter Sonde von hybridisierter markierter Sonde durch eine Gro- 
llenbestimmungs-Methodologie; 

15 

h) Abspalten des Markers von den Sonden; 

i) Nachweisen der Marker durch Spektrometrie oder Potentiometrie; und 

20 j) Nehmen des Verhaltnisses von Marker, der zu cDNA korreliert ist, gegenuber Marker, der mit dem internen 

Standard korreliert ist und daraus bestimmen der Menge der cDNA. wodurch die Menge der spezifischen 
mRNA in einer NukleinsSurepopulation bestimmt wird, 

wobei die markierte Nukleinsauresonde die Struktur 

25 



aufweist, worin T^^ eine organische Gruppe ist, die durch Massenspektrometrie nachweisbar ist, umfassend Koh- 
30 lenstoff, mindestens eines von Wasserstoff und Fluor, und gegebenenfalis Atome ausgewahit aus Sauerstoff, Stick- 

stoff, Schwefel, Phosphor und Jod; 

L ist eine organische Gruppe, die es einer einmaligen *n"«-enthaltenden Gruppe ermoglicht, von dem Rest der 
Verbindung abgespalten zu werden, wobei die P^nthaltende Gruppe eine funktionelle Gruppe umfaftt. die einen 
einzelnen ionisierten Ladungszustand unterstutzt. wenn die Verbindung Massenspektrometrie unterzogen wird, 
35 und die tertiares Amin, quatemSres Amin oder organische Sdure ist; und 

X* ist Nukleinsdure. 

24. Verfahren zur Bestimmung der Menge eines Single Nukleotide Polymorphismus in einem Nukleinsdureziel. um- 
fassend: 

40 

a) Amplifizieren einer Sequenz eines Nukleinsdurezieis. das einen Single Nukleotide Polymorphismus auf- 
weist; 

b) Erzeugen einer einzelstrangigen Form des Ziels; 

45 

c) Kombinieren einer markierten Nukleinsauresonde mit den amplifizierten Zielnukleinsauremolekulen unter 
Bedingungen lind fur eine Zeitdauer, die ausreichend ist, um eine Hybridisierung der markierten Nukleinsau- 
resonde an komplementare amplifizierte ausgewahlte Zielnukleinsauremolekule zu ermoglichen, wobei der 
Marker mit einem bestimmten Single Nukleotide Polymorphismus korrelierbar ist und durch Spektrometrie 

50 Oder Potentiometrie nachweisbar ist; 

d) Abtrennen von nicht hybridisierter markierter Sonde von markierter hybridisierter Sonde durch eine Gro- 
denbestimmungs-Methodologie; 

55 e) Abspalten des Markers von den Sonden; 

f) Nachweisen der Marker durch Spektrometrie oder Potentiometrie; und 
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g) Nehmen des Verhaltnisses von mit dem Wildtyp-Polymorphismus korrelierbaren Markers zu dem mit dem 
Mulantenpolymorphismus korrelierten Markers und daraus bestimmen der Menge des Polymorphismus, 

wobei die markierte Nukleinsaure die Struktur 



aufweist, worin eine organische Gruppe ist, die durch Massenspektrometrie nachweisbar ist. umfassend Koh- 
10 lenstoff. mindestens eines von Wasserstoff und Fluor, und gegebenenfalls Atome ausgewahit aus Sauerstoff . Stick- 

sloff, Schwefel, Phosphor und Jod; 

L Ist eine organische Gruppe, die es einer einmaligen T^^-enthaftenden Gruppe ermoglicht, von dem Rest der 
Verbindung abgespalten zu werden. wobei die T^senthaltende Gruppe eine funktionelle Gruppe umfaftt, die einen 
einzelnen ionlsierten Ladungszustand unterstCitzt, wenn die Verbindung Massenspektrometrie unterzogen wird, 
^5 tind die tertiares Amin, quaternares Amin Oder organische Saure ist; und 

X* ist Nukleinsaure. 
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FIGURE 3 
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A: lOOmM dimethylodylammonium trifuoroocetote 
B: lOOmM OMOATFA/90% acetonitrile 
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A: 50mM Oimethylheplylammomum Acetate pH 6;6 
B: SOmM OMHepAA/95% ACN 

25% B to 95%B in 60 minutes 
50C 

pBR322 Haeill digest 
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A: lOOmM dimefhyllsopropylammonium ocelote 
8: lOOmM OMIPAA/90% acetonitrile 

"5%B~»FT2%B~ifr35mmutes' 
45C 

pBR322 Hceiil digest 
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A: 200mM dimethylcyclohexylammonium acetate 
B: 200mM OMCHAA/90% ocetonitrile 

II %B to 40%B in 35 minutes 
45C 

p6R322 Haeiil digest 
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A: lOOmM 1 -meihylpiperidine ocetote 
B: lOOmM mepipQc/90% ocetonitrile 

11 %B to 51%B in 40 minutes 

-50C 



lOObp ond 200bp stondords 
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A: lOOmM 1 -methylpyrroiidone 

B: lOOmM "mettiylpyrr770% ocetonitrile 

U%B to 100%B in 60 minutes 
50C 

tOObp and 200bp standards 
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FIGURE 141 
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FIGURE 21 
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FIGURE 22A 
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FIGURE 23 
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FIGURE 26 
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